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O© Methods are disclosed for producing proteins 
^having biological activity for blood coagulation me- 
UJdiated by Factor Vila or Factor IX. The proteins are 
produced by mammalian host cells which have been 
stably transfected with a DNA construct containing a 



nucleotide sequence which codes at least partially 
for either Factor VII or Factor IX. The nucleotide 
sequence comprises a first nucleotide sequence en- 
coding a calcium binding domain, joined to a second 
nucleotide sequence positioned downstream of the 
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first sequence. In particular, the first nucleotide se- 
quence may be derived from a genomic clone or 
cDNA clone of Factor VII. The second sequence 
encodes a catalytic domain for the serine protease 
activity of either Factor VIIA or Factor IX. The joined 
sequences code for proteins having substantially the 
same biological activity for blood coagulation as 
either Factor Vila or Factor IX> 
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EXPRESSION OF FACTOR VII AND IX ACTIVITIES IN MAMMALIAN CELLS 



Technical Field 

The present invention relates to blood coagula- 
tion factors in general, and more specifically, to the 
expression of proteins having biological activity for 
blood coagulation. 

Background Art 

Blood coagulation is a process consisting of a 
complex interaction of various blood components 
or factors which eventually gives rise to a fibrin 
clot. Generally, the blood components which par- 
ticipate in what has been referred to as the coagu- 
lation "cascade" are proenzymes or zymogens, 
enzymatically inactive proteins which are converted 
to proteolytic enzymes by the action of an activa- 
tor, itself an activated clotting factor. Coagulation 
factors which have undergone such a conversion 
are generally referred to as"activated factors," and 
are designated by the addition of a lower case 
postscript "a" (e.g., Vila). 

There are two separate systems which can 
promote blood clotting and thereby participate in 
normal haemostasis. These systems have been 
referred to as the intrinsic and the extrinsic coagu- 
lation pathways. The intrinsic pathway refers to 
those reactions which lead to thrombin formation 
through utilization of factors present only in plasma. 
An intermediate event in the intrinsic pathway is 
the activation of Factor IX to Factor IXa, a reaction 
catalyzed by Factor Xla and calcium ions. Factor 
IXa then participates in the activation of Factor X in 
the presence of Factor Villa, phospholipid and cal- 
cium ions. The extrinsic pathway involves plasma 
factors as well as components present in tissue 
extracts. Factor VII, one of the proenzymes re- 
ferred to above, participates in the extrinsic path- 
way of blood coagulation by converting (upon its 
activation to Vila) Factor X to Xa in the presence of 
tissue factor and calcium ions. Factor Xa in turn 
then converts prothrombin to thrombin in the pres- 
ence of Factor Va, calcium ions and phospholipid. 
Because the activation of Factor X to Factor Xa is 
an event shared by both the intrinsic and extrinsic 
pathways, Factor Vila can be used for the treat- 
ment of patients with deficiencies or inhibitors of 
Factor VIII (Thomas, U. S. Patent 4,382,083). There 
is also some evidence to suggest that Factor Vila 
may participate in the intrinsic pathway as well (Zur 
and Nemerson, J. Biol. Chem. 253 : 2203-2209, 
1978) by playing a role in the activation of Factor 
IX. 
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Experimental analysis has revealed that human 
Factor VII is a single-chain glycoprotein with a 
molecular weight of approximately 50,000 daltons. 
In this form, the factor circulates in the blood as an 
inactive zymogen. Activation of Factor VII to Vila 
may be catalyzed by several different plasma prot- 
eases, such as Factor Xlla. Activation of Factor VII 
results in the formation of two polypeptide chains, 
a heavy chain (M r = 28,000) and a light chain (M r 
= 17,000), held together by at least one disulfide 
bond. Factor VII may also be activated to Vila in 
vitro , for example, by the method disclosed by 
Thomas in U.S. patent No. 4 t 456,591 . 

Factor IX circulates in the blood as a single- 
chain precursor of molecular weight 57,000 and is 
converted to an active serine protease (Factor IXa) 
upon cleavage by Factor Xla in the presence of 
Factor VIII. Factor IXa consists of a light chain and 
a heavy chain of molecular weights 16,000 and 
29,000, respectively. 

Current treatment practices for patients having 
coagulation disorders (e.g., deficiencies of Factor 
VIII and IX) generally involve replacement therapy 
with cryoprecipitate or other fractions of human 
plasma containing enriched levels of a particular 
factor. These preparations have heretofore been 
obtained from pooled human plasma, although the 
preparation of cryoprecipitates requires the use of 
a relatively large amount of human plasma as start- 
ing material. 

Therapeutic uses of Factor VII exist in the 
treatment of individuals exhibiting a deficiency in 
Factor VII, as well as Factor VIII and Factor IX 
deficient populations, and individuals with Von Wil- 
lebrand's disease. More specifically, individuals re- 
ceiving Factors VIII and IX in replacement therapy 
frequently develop antibodies to these proteins. 
Continuing treatment is exceedingly difficult be- 
cause of the presence of these antibodies. Patients 
experiencing this problem are normally treated with 
an activated prothrombin complex known to consist 
of a mixture of active and inactive clotting en- 
zymes, including Factor Vila. Further, recent stud- 
ies indicate that small amounts (40-50 microg- 
rams) of injected Factor Vila are effective in con- 
trolling serious on-going bleeding episodes in Fac- 
tor VIII deficient patients who have high levels of 
antibody in their blood (Hedner and Kisiel, ± Clin. 
Invest. 71: 1 836-1 841 , 1 983). 

Due to the diverse sources of the plasma used 
in the preparation of cryoprecipitates, it is difficult 
to test the preparations to ensure that they are free 
of viral contamination. For instance, essentially all 
recipients of cryoprecipitate show a positive test for 
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hepatitis. Recent reports have also indicated that 
some hemophiliacs receiving cryoprecipitate have 
developed acquired immune deficiency syndrome - 
(AIDS). In addition, the purification of large amounts 
of these factors is extremely difficult and expen- 
sive. 

Consequently, there exists a need in the art for 
a method of producing relatively large quantities of 
pure preparations of Factors Vila and Factor IX. 
The present invention fulfills this need through the 
use of recombinant DNA technology, successfully 
eliminating the problem of viral contamination and, 
at the same time, providing a consistent and homo- 
genous source of active Factor Vila to treat Factor 
VII and Factor IX deficient patients and individuals 
with Von Willebrand's disease, as well as providing 
a source of purified Factor IX for use in replace- 
ment therapy. 

Disclosure of the Invention 

Briefly stated, the present invention discloses a 
DNA construct containing a nucleotide sequence 
which codes at least partially for Factor VII. The 
nucleotide sequence comprises a first nucleotide 
sequence encoding a calcium binding domain 
joined to a second nucleotide sequence positioned 
downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor Vila. The 
joined sequences code for a protein which upon 
activiation has substantially the same biological ac- 
tivity for blood coagulation as Factor Vila. The first 
nucleotide sequence may be substantially that of a 
gene encoding Factor VII, Factor IX, Factor X, 
Protein C, prothrombin, or Protein S. Further, the 
first nucleotide sequence may also encode a leader 
peptide corresponding to the respective gene. 

In particular, the first nucleotide sequence may 
be derived from a genomic clone or cDNA clone of 
Factor VII, and may encode the leader peptide and 
amino-terminal portion of Factor VII. The first 
nucleotide sequence may also include a double- 
stranded oligonucleotide. A particularly preferred 
first nucleotide sequence is that encoding the lead- 
er peptide and amino-terminal portion of Factor IX. 

In addition, the present invention discloses re- 
combinant plasmids capable of integration in mam- 
malian host cell DNA. One of the plasmids includes 
a promoter followed downstream by a set of RNA 
splice sites, the RNA splice sites being followed 
downstream by a nucleotide sequence which 
codes at least partially for Factor VII. The 
nucleotide sequence comprises a first nucleotide 
sequence which encodes a calcium binding domain 
joined to a second nucleotide sequence positioned 



downstream of the first sequence. The secor., 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor Vila. The 
joined sequences code for a protein which upon 

5 activation has substantially the same biological ac- 
tivity for blood coagulation as Factor Vila. The 
nucleotide sequence is then followed downstream 
by a polyadenylation signal. 
^. Similar to the recombinant plasmid noted 

io above, the present invention also discloses a sec- 
ond plasmid which includes a promoter followed 
downstream by a set of RNA splice sites, the RNA 
splice sites being followed downstrream by a 
nucleotide sequence which codes at least partially 

T5 for Factor IX. The nucleotide sequence comprises 
a first nucleotide sequence which encodes a cal- 
cium binding domain joined to a second nucleotide 
sequence positioned downstream of the first se- 
quence. The second nucleotide sequence encodes 

20 a catalytic domain for the serine protease activity 
of Factor IX. The joined sequences code for a 
protein having substantially the same biological ac- 
tivity for blood coagulation as Factor IX. The 
nucleotide sequence is then followed downstream 

25 by a polyadenylation signal. 

A third aspect of the invention discloses mam- 
malian cells stably transfected to produce a protein 
having susbtantially the same biological activity, 
upon activation, as Factor Vila. The cells are trans- 

30 fected with a DNA construct containing a 
nucleotide sequence which at least partially codes 
for Factor VII. The nucleotide sequence comprises 
a first nucleotide sequence which encodes a cal- 
cium binding domain joined to a second nucleotide 

36 sequence positioned downstream of the first se- 
quence. The second nucleotide sequence encodes 
a catalytic domain for the serine protease activity 
of Factor Vila. The joined sequences code for a 
protein which, upon activation, has substantially tha 

40 same biological activity for blood coagulation as 
Factor Vila. 

An additional aspect of the invention discloses 
mammalian cells stably transfected to produce a 
protein having substantially the same biological ac- 

45 tivity as Factor IX. The cells are transfected with a 
DNA construct containing a nucleotide sequence 
which codes at least partially for Factor IX. The 
nucleotide sequence comprises a first nucleotide 
sequence which encodes a calcium binding domain 

so joined to a second nucleotide sequence positioned 
downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor IX. The 
joined sequences code for a protein having sub- 

55 stantially the same biological activity for blood co- 
agulation as Factor IX. 
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The present invention further provides for a 
method of producing a protein having biological 
activity for blood coagulation mediated by Factor 
Vila through establishing a mammalian host cell 
which contains a DNA construct containing a 5 
nucleotide sequence which codes at least partially 
for Factor VII. The nucleotide sequence comprises 
a first nucleotide sequence which encodes a cal- 
cium binding domain joined to a second nucleotide 
sequence positioned downstream of the first se- io 
quence. The second sequence encodes a catalytic 
domain for the serine protease activity of Factor 
Vila. The joined sequences code for a protein 
which, upon activation, has substantially the same 
biological activity for blood coagulation as Factor 75 
Vila. Subsequently, the mammalian host is grown 
in an appropriate medium and the protein product 
encoded by the DNA construct and produced by 
the mammalian host cell is isolated. The protein 
product is then activated to generate Factor Vila. 20 

Still a further aspect of the present invention 
discloses a method of producing a protein having 
biological activity for blood coagulation mediated 
by Factor IX. The method comprises establishing a 
mammalian host cell which contains a DNA con- 25 
struct containing a nucleotide sequence which 
codes at least partially for Factor IX. The nucleo 
tide sequence comprises a first nucleotide se- 
quence which encodes a calcium binding domain 
joined to a second nucleotide sequence positioned 30 
downstream of the first sequence. The second 
nucleotide sequence encodes a catalytic domain 
for the serine protease activity of Factor IX. The 
joined sequences code for a protein having sub- 
stantially the same biological activity for blood co- 35 
aguiation as Factor IX. The mammalian host cell is 
subsequently grown in an appropriate medium and 
the protein product encoded by the mammalian 
host cell is isolated. Protein products produced by 
the methods noted above are also disclosed. 40 

Yet another aspect of the present invention 
discloses a DNA construct comprising a DNA se- 
quence encoding Factor VII. In a preferred embodi- 
ment, the DNA sequence comprises the cDNA 
sequence of Figure 1b from bp 36 to bp 1433. In 45 
another preferred embodiment, the DNA sequence 
comprises the cDNA sequence of Figure 1b from 
bp 36 to bp 99, followed downstream by the se- 
quence from bp 166 to bp 1433. Recombinant 
plasmids capable of integration in mammalian host 50 
cell DNA comprising the DNA sequences de- 
scribed immediately above are also disclosed. 

Mammalian cells stably transfected with a re- 
combinant plasmid comprising a DNA sequence 
encoding Factor VII are also disclosed. In preferred 55 
embodiments, the DNA sequence comprises the 



cDNA sequence of Figure 1b from bp 36 to 
bp1433, or the cDNA sequence of Figure 1b, from 
bp 36 to bp 99, followed downstream by the se- 
quence from bp 166 to bp 1433. 

A method for producing a protein having bio- 
logical activity for blood coagulation mediated by 
Factor Vila through establishing a mammalian host 
celt that contains a DNA construct as described 
above is also disclosed. The mammalian 'host cell 
is subsequently grown in an appropriate medium, 
and the protein product encoded by the DNA con- 
struct is isolated. The protein product is then ac- 
tivated to generate Factor Vila. 

Other aspects of the invention will become 
evident upon reference to the following detailed 
description and attached drawings. 

Brief Description of the Drawings 



Figure 1a illustrates the partial Factor VII 
cDNA sequence produced by joining por- 
tions of cDNA clones XVII2115 and XVII1923. 

Figure 1b illustrates the Factor VII cDNA 
sequence of XVII2463. Arrows indicate the 
extent of the deletion, in the sequence of 
XVII565. Numbers above the sequence des- 
ignate amino acids. Numbers below des- 
ignate nucleotides. 

Figure 2a illustrates the amino acid se- 
quences of the amino terminal regions of 
several clotting factors. 

Figure 2b illustrates a comparison of the 
amino acid sequence of Factor VII obtained 
from protein sequencing with that encoded 
by the cDNA. 

Figure 3 illustrates the joining of Factor IX 
leader sequences to a sequence encoding a 
consensus calcium binding domain. 

Figure 4 illustrates the joining of the Factor 
IX-consensus sequence hybrids to a partial 
Factor VII cDNA to produce an in-frame cod- 
ing sequence. 

Figure 5 illustrates the construction of a plas- 
mid containing a coding sequence for a Fac- 
tor IX/Factor VII fusion protein. 

Figure 6 illustrates the expression vector 
FlX/VII/pD2. Symbols used are Ad2 MLP, the 
major late promoter from adenovirus 2; L1-3, 
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the adenovirus 2 tripartite leader sequence; 
5'ss, 5' splice site' 3'ss, 3* splice site' and 
pA, the late polyadenylation signal from 
SV40. 

Figure 7 illustrates the nucleotide sequence 
of a Factor IX/Factor VII cDNA fusion. 

Figure 8 illustrates expression vector 
pM7135. Symbols used are E. the SV40 
enhancer; ori, the 0-1 map units Ad 5; pA, 
the early polyadenylation signal from SV40; 
A, the deletion region of the pBR322 
"poison" sequences; and other symbols as 
described for Figure 6. 

Figure 9 illustrates the subcloning of the 
2463bp Factor VII cDNA. 

Figure 10 illustrates the subcloning of the 
565bp Factor VII cDNA. 

Figure 1 1 illustrates the joining of the 5' end 
of pVII565 and the 3' portion of pVII2463in 
PUC18 to generate pVII2397. 

Figure 12 illustrates the construction of the 
expression plasmids FVII(2463)/pDX and 
FVII(565 + 2463)/pDX. pA denotes the 
polyadenylation signal from SV40 in early or 
late orientation, as described in Example 9. 
Other symbols are as described for Figure 8. 

Best Mode for Carrying Out the Invention 

Prior to setting forth the invention, it may be 
helpful to an understanding thereof to set forth 
definitions of certain terms to be used hereinafter. 

Complementary DNA or cDNA : A DNA mol- 
ecule or sequence which has been enzymatically 
synthesized from the sequences present in a 
mRNA template. 

DNA Construct : A DNA molecule, or a clone of 
such a molecule, either single-or double-stranded, 
which may be isolated in partial form from a natu- 
rally occurring gene or which has. been modified to 
contain segments of DNA which are combined and 
juxtaposed in a manner which would not otherwise 
exist in nature. 

Plasmid or Vector : A DNA construct containing 
genetic information which may provide for its repli- 
cation when inserted into a host cell. A plasmid 
generally contains at least one gene sequence to 



be expressed in the host cell, as well as sequenc. 
which facilitate such gene expression, including 
promoters and transcription initiation sites. It may 
be a linear or closed circular molecule. 

5 Joined : DNA sequences are said to be joined 

when the 5* and 3' ends of one sequence* are 
attached, by phosphodiester bonds, to the 3* and 5' 
ends, respectively, of an adjacent sequence. Join- 
ing may be achieved by such methods as ligation 

10 of blunt or cohesive termini, by synthesis of joined 
sequences through cDNA cloning, or by removal of 
intervening sequences through a process of di- 
rected mutagenesis. 

Leader Peptide : An amino acid sequence 

75 which occurs at the amino terminus of some pro- 
teins and is generally cleaved from the protein 
during subsequent processing and secretion. Lead- 
er peptides comprise sequences directing the pro- 
tein into the secretion pathway of the cell. As used 

20 herein, the term "leader peptide* may also mean a 
portion of the naturally occurring leader peptide. 

Domain : A three-dimensional, self-assembling 
array of specific amino acids in a protein molecule 
which contains all or part of the structural elements 

25 necessary for some biological activity of that pro- 
tein. 

Biological Activity : A function or set of func- 
tions performed by a molecule in a biological con- 
text (i.e. . in an organism or an in vitro facsimile). 

30 Biological activities of proteins may be divided into 
catalytic and effector activities. Catalytic activities 
of clotting factors generally involve the activation of 
other factors through the specific cleavage of pre- 
cursors. Effector activities include specific binding 

35 of the biologically active molecule to calcium or 
other small molecules, to macromolecules such as 
proteins, or to cells. Effector activity frequently 
augments, or is essential to, catalytic activity under 
physiological conditions. Catalytic and effector ac- 

40 tivities may, in some cases, reside within the same 
domain of a protein. 

For Factor Vila, biological activity is character- 
ized by the mediation of blood coagulation through 
the extrinsic pathway. Factor Vila activates Factor 

45 X to Factor Xa, which in turn converts prothrombin 
to thrombin, thereby initiating the formation of a 
fibrin clot. Because the activation of Factor X is 
common to both the extrinsic and intrinsic path- 
ways of blood coagulation, Factor Vila may be 

so used to treat individuals severely deficient in the 
activities of Factor IX, Factor VIII or Von Willebrand 
Factor. 

The biological activity of Factor IX is character- 
ized by the mediation of blood coagulation through 
55 the intrinsic pathway. Factor IX is activated to Fac- 
tor IXa by Factor Xia. Factor IXa then activates 
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Factor X to Factor Xa in the presence of Factor 
Villa, phospholipid, and calcium ions. Factor Xa 
then acts in the conversion of prothrombin to 
thrombin, initiating the formation of a fibrin clot. 

As noted above, the isolation of Factor VII from 5 
human plasma is a time-consuming and expensive 
process since the factor is a rare protein present 
only at a concentration of approximately 300 micro- 
grams per liter of blood. In addition, it is difficult to 
separate from prothrombin. Factor IX and Factor X ro 
and is susceptible to proteolytic attack during pu- 
rification (Kisiel and McMullen, ibid). Although 
single-chain human Factor VII has been purified to 
homogeneity (Kisiel and McMullen, ibid), the pub- 
lished purification methods are generally limited by is 
low yield and/or contamination by other coagulation 
factors. 

Factors VII and IX are produced in the liver and 
require vitamin K for their biosynthesis. Vitamin K 
is necessary for the formation of specific gamma- 20 
carboxyglutamic acid residues in the factors. These 
unusual amino acid residues, which are formed by 
a post-translational modification, bind to calcium 
ions and are responsible for the interaction of the 
protein with phospholipid vesicles. In addition, Fac- 25 
tors VII and IX -each contain one 0-hydroxyaspartic 
acid residue which is also formed after the proteins 
have been translated. However, the role of this 
amino acid residue is not known. 

Given the fact that the activities of Factor VII 30 
and IX are dependent upon post-translational modi- 
fications involving the gamma carboxylation of spe^ 

cific glutamic acid residues, and may also be de- 
pendent upon the hydroxylation of a specific aspar- 
tic acid- residue, it is unlikely that an active product 35 
could be produced through -the cloning and expres- 
sion of Factors VII and IX in a microorganism. 

Accordingly, the present invention provides a 
method of producing a protein having biological 
activity for blood coagulation mediated by Factor ao 
Vila using stably transfected mammalian cells. In 
addition, the present invention also provides a 
method of producing a protein having biological 
activity for blood coagulation mediated by Factor 
IX. 45 

As noted above, Factors VII and IX require 
vitamin K for their biosynthesis. In addition, the 
plasma proteins prothrombin, Factor X, Protein C, 
and Protein S also require vitamin K for their bio- 
synthesis. The amino-terminal portions of these so 
proteins, which contain gamma-carboxyglutamic 
acid residues, are homologous in both amino acid 
sequence and in biological function (Figure 2a). 
Further, the carboxy-terminal portions of Factor VII, 
prothrombin, Factor IX, Factor X, and Protein C 55 
determine their specific serine protease functions. 



Factor VII is a trace plasma protein, and the 
mRNA encoding Factor VII is believed to be rare. 
Consequently, purification of Factor VII from plas- 
ma in sufficient quantities to permit extensive se- 
quence analysis and characterization remains dif- 
ficult. Degradation of Factor VII during purification, 
even in the presence of protease inhibitors, was 
noted by Kisiel and McMullen (ibid). Due to these 
difficulties, Factor VII has been poorly character- 
ized, compared to other more abundant compo- 
nents of the blood coagulation system. Indeed, the 
work of Kisiel and McMullen (ibid) yielded se- 
quence information for only 10 residues of each 
chain of Factor VII, and in each sequence the 
identification of two residues was tentative. Partial 
amino acid sequence data for Bovine Factor VII 
have also been published (DiScipio et al.. ibid). 

The presumed rarity of Factor VII mRNA has 
contributed to the lack of knowledge of the Factor 
VII gene. The success of conventional cDNA clon- 
ing techniques is dependent on a sufficient quantity 
of mRNA for use as a template. Premature termina- 
tion of reverse transcription results in the produc- 
tion of cDNA clones lacking the 5' end and this 
condition is exacerbated by low mRNA levels. Sev- 
eral strategies for cDNA cloning of low abundance 
message have been developed (Maniatis et aL, 
Molecular Cloning: A Laboratory Manual . Cold 
Spring Harbor Laboratory, 1982), but a lack of 
knowledge of the amino acid sequence of the prod- 
uct of interest makes it impossible to predict the 
DNA sequence and to design appropriate 
oligonucleotide probes. While it may be relatively 
straightfoward to obtain a partial cDNA clone of a 
gene encoding a rare protein by using these ad- 
vanced strategies, full-length cDNA clones of 
genes encoding rare proteins such as Factor VII 
remain exceedingly difficult to obtain. 

In comparison to Factor VII, Factor IX is a 
relatively abundant protein and the sequence of a 
cDNA clone of the human Factor IX gene is known 
(Kurachi and Davie, Proc. Natl. Acad. Sci. USA 79: 
6461-6464, 1982; and Anson et al., EMBO J.3 : 
1053-1060, 1984). The structure of the Factor IX 
gene has been characterized and the amino acid 
sequence of the protein has been determined on 
the basis of the known nucleotide sequence. Some 
protein sequence data have also been published 
for human and bovine Factor IX and the sequences 
analyzed (DiScipio et al.. ibid). The amino terminal 
portion of the protein contains 12 glutamic acid 
residues that are converted to -y-carboxyglutamic 
acid (Gla) residues in the mature protein. The 
cleavage sites involved in the activation of Factor 
IX have also been identified (Kurachi and Davie, 
ibid). A sequence at the 5* end of the Factor IX 
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cDNA clone codes for a signal peptide which is 
typical of those found in most secreted proteins - 
(Kurachi and Davie, ibid). The expression of the 
Factor IX gene through recombinant DNA methods 
has not been previously reported. 5 

Because of the difficulty in obtaining a full- 
length cDNA clone of the Factor VII gene, three 
novel approaches were adopted to supply the 5' 
end of the coding sequence, including the region 
encoding the leader peptide. According to the first 10 
-method, a partial cDNA clone for Factor VII is 
joined to a fragment encoding the leader peptide 
^and 5' portion of Factor IX. This approach is based 
on the observation that the amino-terminal portions 
of the two molecules are responsible for the cal- 75 
cium binding activities of the respective proteins 
and the discovery that the calcium binding activity 
■of Factor IX can substitute for that of Factor VII. 
The resultant polypeptide retains the biological ac- 
tivity of authentic Factor VII because the specific 20 
serine protease activities of the coagulation factors 
reside in the carboxy-terminal regions of the mol- 
ecules. The second approach combines the partial 
cDNA clone with a DNA sequence encoding the 
leader and amino-terminal regions of Factor VII. 25 
The partial cDNA and amino acid sequences of 
Factor VII disclosed herein enable the screening of 
a genomic DNA library or cDNA library for clones 
comprising the 5* portion of the Factor VII gene. 
The third approach involves joining the partial 30 
cDNA clone to hybrid coding sequences compris- 
ing a cDNA fragment encoding the leader peptide 
of Factor IX and a synthetic gene segment encod- 
ing a consensus calcium binding domain or a pre- 
dicted amino terminal sequence for Factor VII. The 35 
coding sequence for the amino terminus of Factor 
VII was established through previously unpublished 
amino acid sequence data disclosed herein. The 
consensus sequence was derived from the factor 
VII data and published sequence data for other 40 
vitamin K-dependent plasma proteins. 

Consistent with the approach described above 
for screening for clones comprising the 5 f portion 
of the Factor VII gene, the inventors have been 
successful in obtaining a full-length, correct cDNA 45 
that is suitable for expression. 

Among the cDNA clones that were generated, 
a clone designated " XVH2463" contained the larg- 
est Factor VII cDNA insert. It was found to contain 
the entire coding sequence for Factor VII. This so 
clone included a 35 nucleotide 5* untranslated re- 
gion, 180 nucleotides coding for a 60 amino acid 
leader, J 21 8 nucleotides coding for the 406 amino 
acid mature protein, a stop codon, 1026 
nucleotides of 3* untranslated sequence, and a 20 55 
base poiy(A) tail (beginning at position 2463). This 
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cDNA has now been sequenced in its entirety c. 
both strands. A comparison of it with two cDNA 
inserts isolated earlier from clones XVII2115 and 
XVIII923, revealed that XVII2463 contains, on a sin- 
gle EcoRI fragment, a Factor VII cDNA coding for 
Factor VII leader and mature protein sequences. 

A second clone, XVII565, was isolated that con- 
tained a cDNA insert that was identical to the cDNA 
of clone XVII2463 from nucleotide 9 to nucleotide 
638, except that it lacked nucleotides 100 to 165 - 
(Figure 1b). In comparing the cDNAs to Factor VII 
genomic DNA, the absent sequences correspond 
precisely to one exon-like region. Therefore, two 
Factor VII cDNAs have been obtained which ap- 
pear to reflect alternative mRNA splicing events. 

The leader encoded by XVII2463 is exception- 
ally long (60 amino acids) and has a very different 
hydrophobicity profile when compared with Factor 
IX, protein C and prothrombin. This leader contains 
two mets, at positions -60 and -26. Initiation most 
likely beigns at the first met, since a hydrophobic 
region, typical of signal peptides, follows the met at 
position -60, but not the met at -26. It is interesting 
that the absent sequence in XVU565, which cor- 
responds precisely to an exon-like region in the 
genomic clone, results in a 38 amino acid leader 
with a hydrophobicity pattern more analogous to 
Factor IX, protein C, and prothrombin. 

Since it was not clear then which, if either, of 
the leaders described above was authentic, an ad- 
ditional approach was initiated in an effort to ana= 

lyze the 5* end sequence. Briefly, this approach 
included the construction and screening of a hu- 
man genomic DNA library, and the identification of 
genomic clones comprising Factor VII gene se- 
quences. The 5 1 portion of the genomic sequence 
was subseqently joined to the cDNA to construct a 
full-length clone. 

In an additional construct, a 5* Factor VII cDNA 
fragment of XVII565 containing all of the leader and 
29 amino acids of the mature coding sequence was 
ligated to a fragment of the cDNA of XVI12463 - 
(containing the remainder of the mature protein and 
3*-untranslated sequences). This "565-2463" se- 
quence encodes a full-length Factor VII cDNA se- 
quence as a single EcoRI fragment. 

The DNA sequences described above are then 
inserted into a suitable expression vector which is 
in turn used to transfect a mammalian cell line. 
Expression vectors for use in carrying out the 
present invention will comprise a promoter capable 
of directing the transcription of a foreign gene in a 
transfected mammalian cell. Viral promoters are 
preferred due to their efficiency in directing tran- 
scription. A particulary preferred such promoter is 
the major late promoter from adenovirus 2. Such 
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expression vectors will also contain a set of RNA 
splice sites located downstream from the promoter 
and upstream from the insertion site for a gene 
encoding a protein having biological activity for 
blood coagulation. Preferred RNA splice site se- 
quences may be obtained from adenovirus and/or 
immunoglobulin genes. Also contained in the ex- 
pression vectors is a polyadenylation signal, lo- 
cated downstream of the insertion site. Viral 
polyadenylation signals are preferred, such as the 
early or late polyadenylation signals from SV40 or 
the polyadenylation signal from the adenovirus 5: 
Elb region. In a particularly preferred embodiment, 
the expression vector also comprises a viral leader 
sequence, such as the adenovirus 2 tripartite lead- 
er, located between the promoter and the RNA 
splice sites. Preferred vectors may also include 
enhancer sequences, such as the SV40 enhancer. 

Cloned DNA sequences may then be intro- 
duced into cultured mammalain cells by calcium 
phosphate mediated transfection. (Wigler et al., 
Cell 14: 725. 1978; Corsaro and Pearson, Somatic 
Cell Genetics 7: 603, 1981; Graham and Van der 
Eb, Virology 52: 456, 1973.) A precipitate is formed 
of the DNA and calcium phosphate and this 
precipitate is applied to the cells. A portion of the 
cells take up the DNA and maintain it inside the 
cell for several days. A small fraction of the cells - 
m (typically 10 **) stably integrate the DNA into the 
genome. In order to identify these stable integrants, 
a gene that confers a selectable phenotype (a 
selectable marker) is generally introduced along 
with the gene of interest. Preferred selectable 
markers include genes that confer resistance to 
drugs, such as G-418 and methotrexate. Selectable 
markers may be introduced into the cell on a 
separate plasmid at the same time as the gene of 
interest or they may be introduced on the same 
plasmid. A preferred selectable marker is the gene 
for resistance to the drug G-418, which is carried 
on the plasmid pKOneo (Southern and Berg, J. 
Mol. ApdI. Genet* I: 327-341, 1982). It may also be 
advantageous to add additional DNA, known as 
"carrier DNA," to the mixture which is introduced 
into the cells. After the cells have taken up the 
DNA, they are allowed to grow for a period of time,, 
typically 1-2 days, to begin expressing the gene of 
interest. Drug selection is then applied to select for 
the growth of cells which are expressing the selec- 
table marker in a stable fashion. Clones of such 
cells may be screened for expression of the protein 
of interest. 

Factor VII and Factor IX produced by the trans- 
fected cells may be removed from the cell culture 
media by adsorption to barium citrate. Spent me- 
dium is mixed with sodium citrate and barium 
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chloride and the precipitate col lected. The 
precipitated material may then be assayed for the 
presence of the appropriate clotting factor. Further 
purification may be achieved through immunoad- 

5 sorption. It is preferred that the immunoadsorption 
column comprise a high-specificity monoclonal 
antibody. Alternatively, purification of the barium 
citrate precipitated material may be accomplished 
by more conventional biochemical methods or by 

to high-performance liquid chromatography. 

Conversion of single-chain Factor VII to active 
two-chain Factor Vila may be achieved using Fac- 
tor Xlla as described by Hedner and Kisiel (i Clin. 
Invest. 7V. 1836-1841, 1983), or with other prot- 

75 eases having trypsin-like specificity (Kisiel and 
Fujikawa, Behrino Inst. Mitt. 7£: 29-42, 1983). 

In summary, the present invention provides a 
method for the production of proteins having the 
activity of vitamin K-dependent blood coagulation 

20 factors using transfected mammalian cells. Gene 
sequences encoding the specific serine protease 
domains of the coagulation factors are ..isolated 
from cDNA libraries. Sequences encoding the lead- 
er peptides and calcium binding domains are iso- 

25 lated from cDNA or genomic libraries or construct- 
ed from synthesized oligonucleotides. The se- 
quences are then joined in an appropriate expres- 
sion vector so as to encode a protein having the 
desired biological activity for blood coagulation. 

30 The resulting vector and a plasmid containing a 
drug resistance marker are co-transfected into ap- 
propriate mammalian tissue culture cells. Transfec- 
ted cells may then be selected by addition of the 
appropriate drug, such as G-418. The protein pro- 

35 ducts are then purified from the cell growth media 
and assayed for biological activity in a blood co- 
agulation assay and for immunological cross-reac- 
tivity using antibodies prepared against authentic 
human clotting factors. 

40 To summarize the examples which follow, Ex- 

ample 1 discloses the cloning of a full-length cDNA 
sequence for Factor VII. Example 2 discloses a 
partial amino acid sequence of human Factor VII, 
including the sequence of approximately 30 amino 

45 acids at the amino terminus. Example 3 discloses 
the construction and screening of a human 
genomic DNA library and the identification of 
genomic clones comprising Factor VII gene se- 
quences. Example 4 discloses the construction of 

so two hybrid gene segments, each comprising a 
cDNA fragment encoding the leader peptide of 
Factor IX and a synthesized double-stranded frag- 
ment encoding a consensus calcium binding do- 
main. The hybrid sequences are then joined to 

55 partial cDNA clones of Factor VII. Using in vitro 
mutagenesis, the consensus sequence was then 
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altered to conform to the protein sequence data for 
Factor VII. Example 5 describes the construction of 
a gene sequence encoding a fusion protein com- 
prising the calcium binding domain of Factor IX 
and the specific serine protease domain of Factor 
VII. Example 6 describes the construction of the 
vector pD2 for use in expressing proteins having 
biological activity for blood coagulation in transfec- 
ted mammalian cells. The gene fusion described in 
Example 5 is expressed using this vector. Example 
7 describes the use of the vector pD2 to express a 
gene for Factor IX in a transfected mammalian cell 
line. Example 8 describes the construction of the 
vector pM7135, which contains DNA sequences 
encoding a primary translation product comprising 
the leader sequence of Factor IX fused to Factor 
VII. This vector may be used to produce a protein 
having the activity of Factor VII in a transfected 
mammalian cell line. Example 9 describes the ex- 
pression of Factor VII using cDNA sequences, and 
the expression of Factor VII from a genomic-cDNA 
hybrid sequence. 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXAMPLES 

Restriction enzymes were obtained from Be- 
thesda Research Laboratories (BRL) and New Eng- 
land Biolabs and were used as directed by the 
manufacturer, unless otherwise noted. 
Oligonucleotides were synthesized on an Applied 
Biosystems Model 380 A DNA synthesizer and 
purified by polyacrylamide gel electrophoresis on 
denaturing gels. E. cqK cells were transformed as 
described by Maniatis et ai. (Molecular Cloning: A 
Laboratory Manual . Cold Spring Harbor Laboratory, 
1982). M13 and pUC cloning vectors and host 
strains were obtained from BRL. Factor VII was 
prepared from human plasma as described by 
Kisiel and McMullen (ibid). 

Example 1 : Cloning of a Partial Factor VII cDNA. 

A. Construction of a human liver cDNA library. 

A cDNA library was prepared from human liver 
mRNA by the method of Chandra et al., Proc. NatL 
Acad. Sci. U.S.A. 80 : 1845-1848, 1983. The cDNA 
preparation was sedimented through an alkaline 
sucrose gradient (Monahan et aL, Biochemistry 15: 
"223-233, 1976) and fractions containing species of 
greater than about 1000 nucleotides were pooled. 
The first strand preparation was made double- 
stranded using reverse transcriptase (Chandra et 
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al., 1 983),* treated with S1 nuclease, and the resit I 
ual staggered ends filled-in using DNA Polymerase 
I (Klenow fragment) in the presence of all four 
deoxyribonucleotide triphosphates (Maniatis et aL 

5 Molecular Cloning: A Laboratory Manual , Cold 
Spring Harbor Laboratory, 1982). The blunt-ended 
cDNA was treated with Eco Rl methylase and ligat- 
ed to phosphorylated Eco Rl linkers using T 4 DNA 
ligase (Maniatis et al., ibid). The ligated DNA prep- 

io aration was exhaustively digested with Eco Rl to 
remove excess linker sequences and double- 
stranded DNAs greater than about 1000 base pairs 
in length were purified by neutral sucrose gradient 
centrifugation (Maniatis et al., ibid). Native \gt11 

75 DNA was ligated into concatemers, digested to 
completion with Eco Rl, and the 5' terminal phos- 
phates were removed by treatment with bacterial 
alkaline phosphatase. The pooled human liver 
cDNA was ligated with the phage DNA, packaged 

20 in vitro (Maniatis et ah, ibid), and used to infect E. 
coli Y1088 (Young and Davis, Science , 222 : 778- 
782, 1983). Approximately 14 x 10 6 primary phage 
* plaques were generated in this library, composed 
of seven libraries of - 2 x 10* plaques each. 

25 Greater than 90% of these were recombinants con- 
taining human DNA inserts, based on their lack of 
0-galactosidase activity and characterization of 20 
random clones by Eco Rl digestion followed by 
agarose gel electrophoresis. The cDNA library, in 

30 the form of phage particles, was purified by cesium 
chloride gradient centrifugation and stored in SM 
buffer (Maniatis et at., ibid). 

B. Screening of the human liver cDNA library for 
35 Factor VII clones. 

The human liver expression cDNA library de- 
scribed above was screened for specific antigen - . 
(Young and Davis, ibid) using an 125 l-labeled mon- 

40 oclonal Factor VII antibody prepared by the method 
of Brown et al. (i Biol.Chem. 225 : 4980-4983, 
1980) using purified Factor VII. Screening of 6 x 
10* phage plaques identified one isolate, desig- 
nated XVII2115, which gave a positive response 

45 with the antibody. 

The phage clone XVII2115 was tested against 
two other anti-Factor VII monoclonal antibodies and 
a rabbit polyclonal antibody to Factor VII. Isolate 
XVII2115 gave a positive response to all these anti- 

so Factor VII antibodies. 

DNA was prepared from a plate lysate 
(Maniatis et al.. pp. 65-66, 1982) of XVII2115. Di- 
gestion of this DNA with Eco Rl liberated an insert 
of 2139 base pairs. This insert was subcloned into 

55 M13 phage vectors (Messing, Meth. in Enzymoloov 
101 : 20-77, 1983; and Norrander et al., Gene 26 : 
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101-106, 1983) for chain termination dideoxy DNA 
sequencing (Sanger et aL, Proc. Natl. Acad. Sci. 
U.S.A. 74: 5463-5476, 1977). This cDNA insert con- 
tains Pst I sites at positions 214, 839, and 1205 - 
(designated Pst la, Pst lb f and Pst (c, respectively, 
in Figure 1a) and a Sma I site located at position 
611. The following M13 templates were sequenced: 

1) full-length (2139 bases) Eco Rla — Eco 
Rib fragment in M13mp18 (designated clone 
F7-1); 

2) Pst la -* Eco Rla 214 base fragment in 
M13mp19 (F7-2); 

3) Pst la — Pst lb 625 base fragment in 
M13mp18 (F7-3); 

4) Pst lb — Pst la 625 base fragment in 
M13mp18 (F7-7); 

5) Sma I — Pst lb 228 base fragment in 
M13mp10 (F7-8); 

6) Pst lb — Pst lc 366 base fragment in 
M13mp18 (F7-9); 

7) Pst lc — Pst lb 366 base fragment in 
M13mp18 (F7-10); 

8) Pst lc —> Eco Rib 930 base fragment in 
M13mp19 (F7-11); and 

9) Eco Rib Eco Rla full-length fragment in 
M13mp18 (F7-1 2) (restriction site designa- 
tions refer to Figure 1a). 

The data confirmed the sequence on both 
strands for 91% of the coding region and 
15% of the 3* non-coding region and yielded 
single-stranded sequence information for the 
remaining 9% of the coding region and 85% 
of the non-coding region. 
Comparison of the amino acid sequence pre- 
dicted from the cDNA sequence with the known 
amino acid sequence data of Kisiel and McMullen - 
(Thrombosis Research 22 : 375, 1981) and the ami- 
no acid sequence shown below (Example 2) re- 
vealed an anomaly which could be explained by 
the absence of three nucleotides in the DNA se- 
quence near position 400. To obtain additional se- 
quence data, XVII2115 was digested with Eco Rl, 
and the Factor VII coding fragment was inserted 
into pUC 13 (Vieira and Messing, Gene 19 : 259- 
268, 1982; and Messing, ibid) which had been 
digested with Eco Rl. The resultant recombinant 
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plasmid, designated pUCVII21l5, was digested 
with Xba I which cut at position 328. The digested 
sample was divided in half: half was labeled with 
o 32 P dCTP and DNA Polymerase I (Klenow frag- 

s ment) (Englund, P.T., J, MgL §6 : 209. 1972); 
the other half was labeled with «y 32 P ATP and 
polynucleotide kinase (Chaconas et al., Biochem. 
Bioohvs. Res. Comm. §§: 962, 1975). The labeled 
plasmids were then recut with Pst I to yield 113 

io and 509 base pair fragments. Both strands of each 
of these were sequenced by the method of Maxam 
and Gilbert ( Meth. in Enzvmoioav 74: 560, 1980). 
The 113 base pair fragment was sequenced in its 
entirety and 210 base pairs of the 509 base pair 

rs fragment were sequenced. These sequences re- 
vealed three additional bases ( one C and two G*s) 
which rendered the DNA sequence data in agree- 
ment with the protein sequence data, indicating 
that the previous anomalous results arose from 

20 compressions on the sequencing gel due to secon- 
dary structure involving G's and C's. The sequence 
of the last 9% of the coding region on both strands 
was also confirmed. 

Further analysis of the sequence of the 

25 pUCVII2115 insert confirmed that a portion of this 
cloned fragment encoded a sequence of 1 1 amino 
acids known to be at the cleavage site of Factor VII 
(Kisiel and McMullen, Thrombosis Research 22 : 
375, 1981). Comparison of this sequence to Factor 

30 IX (Davie et al., ibid) and Factor X (Leytus et al., 
Proc. Natl. Acad. Sci. U.S.A. §1: 3699-3702, 1984) 
amino acid sequences suggested that the clone 
contained the sequence for Factor VII beginning at 
(approximately) nucleotides coding for amino acid 

35 36 of the mature Factor VII protein and continuing 
through approximately 1000 coding and 1100 non- 
coding nucleotides and poly A sequence. In addi- 
tion, it was found that this clone had frameshift 
mutations in the 3' coding portion. 

40 In order to obtain the correct 3* coding region, 

all 14 million clones of the seven Xgtlt cDNA 
libraries were screened by plaque hybridization - 
(Benton and David. Science 196 : 180-181, 1977) 
with nick-translated cDNA of XVII2115 (Maniatis et 

45 al., pp. 109-112, 1982). 

Seven positive isolates were then screened by 
dideoxy sequencing of pUC plasmids into which 
the cDNA inserts had been subcloned (Wallace et 
al., Gene 16 : 21, 1981). The \gt11 clones were 

so digested with Eco Rl and the Factor VII fragments 
were inserted into pUC13 which had been cleaved 
with Eco Rl. All except one of these were found to 
start at a position corresponding to base 212 of the 

55 
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insert in XVII2115; the one exception consisted only 
of 3* non-coding sequence. One of the clones 
starting at base 212 was selected for analysis and 
was designated clone pUCVII1923. 

Because analysis of pUCVII2115 indicated the 5 
presence of frameshift mutations between positions 
657 and 815, pUCVII1923 was first analyzed in this 
region by Maxam-Gilbert sequencing. Plasmid 
pUCVII1923 was digested with Nar I (position 779 
in Figure 1a). The cut DNA was labeled with a 32 P 10 
dCTP using DNA polymerase I (Klenow fragment) 
and subsequently digested with Ava I (which 
cleaves at the same site as Sma I in Figure I) and 
Taq I (site at 1059), yielding a Nar l-Ava i 166 bp 
fragment and a 200 bp Nar l-Taq I fragment. Each 75 
of these was sequenced. A C, missing in 
pUCVH2115, was found at position 697 and another 
C, also missing in pUCVII2115. was found at post- 
ition 798. 

The rest of the sequence of the coding region 20 
of pUCVII1923 was shown to be correct by sequen- 
cing by the dideoxy method on an M13 subclone 
of the entire insert of pUCVII1923. The Lac primer 
ZC87 (Table 1) was used to sequence from posi- 
tion 212 (Figure 1a) to 512; primer ZC218 25 
(CTCTGCCTGCCGAAC) was used to sequence 
from 715 to 1140 and primer ZC217 
(ATGAGAAGCGCACGAAG) was used for sequen- 
cing from 720 to 350. Since the pUCVH2115 insert 
is correct from position 13 (positions 1-12 include 30 
an artificial linker) to 695, and pUCVII1923 is cor- 
rect from position 212 to the end, the two were 
spliced together to yield a molecule correct from 
position 13 (Figure 1a) to the end. A convenient 
point utilized for this splice is the Xba I site at 35 
position 328. The sequence of the spliced cor- 
rected molecule is shown in Figure 1a. 

Because a full-length Factor VII clone was dif- 
ficult to obtain by cDNA cloning, three strategies 
were adopted to provide the missing coding se- 40 
quence and the necessary upstream processing 
and signal sequences. The first strategy was to 
obtain the needed sequence from a human 
genomic DNA library or through additional screen- 
ing of cDNA libraries. The second approach was to 45 
synthesize the necessary 5' coding sequence, 
based on the amino acid sequence data for Factor 
VII (Example 2) and the published sequences of 
the genes encoding vitamin K-dependent clotting 
factors (Kurachi and Davie, ibid; and Davie et at., so 
ibid), and join this to a portion of the prepro se- 
quence of Factor IX. The third strategy relies on 
the functional homology of the amino terminal re- 
gions of Factor VII and Factor IX. A sequence was 
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constructed which comprised the coding regioi 
for the leader and amino-terminal portion of Factor 
IX. This was then fused in the proper orientation to 
the partial Factor VII cDNA. 

In order to obtain DNA sequences that com- 
prise the entire DNA sequence of Factor VII, an 
attempt was made to isolate the remaining 5* DNA 
sequence. This was accomplished through the utili- 
zation of the 5* terminal 0.3 kb EcoRI-Xbal frag- 
ment from the cDNA insert of WII2115 to screen a 
cDNA library comprising 2 x 10 6 phage. The library 
was constructed using poly(A) mRNA from HepG2 
cells following an adaptation of the method of 
Gubler and Hoffman (Gene 25: 263-269, 1983). The 
RNA was reverse transcribed to generate first 
strand cDNA, followed by second strand synthesis 
using DNA polymerase I and RNase H. Following 
EcoRI methylatioh and passage over a Sepharose 
6B column, the DNA termini were blunted with T* 
DNA polymerase. EcoRI linkers were added and 
excess linkers were removed by digestion with 
EcoRI and chromatography on Sepharose CL 2B. 
The DNA in the void volume was collected and 
ligated to XgtH which had been digested with 
EcoRI and treated with calf intestinal phosphata.se. 
The DNA was packaged and infected into E. coli 
Y1088. Several positives were detected, and the 
EcoRI fragments were subsequently subcloned into 
M13 phage vectors for dideoxy sequencing using 
either the M13 universal primer or Factor VII spe- 
cific oligonucleotides. 

From these, three new cDNA clones of Factor 
VII were obtained, and their sequences completely 
determined. The largest of these cDNAs, from a 
clone designated WII2463, was found to contain 
the entire coding sequence for Factor VII. This 
clone included a 35 nucleotide 5' untranslated re- 
gion, 180 nucleotides coding for a 60 amino acid 
leader, 1218 nucleotides coding for the 406 amino 
acid mature protein, a stop codon, 1026 
nucleotides of 3* untranslated sequence, and a 20 
base poly(A) tail (beginning at position 2463). This 
cDNA has now been sequenced in its entirety on 
both strands. A comparison of it with two cDNAs 
isolated earlier from clones XVII2115 and XVII1923 
revealed that clone XVII2463 contains an additional 
321 nucleotides upstream of the insert in XVII21 1 5 
and 519 nucleotides upstream of the insert in 
XVII1923. The overlapping Factor VII sequences of 
XVII2463 and these two previous cDNAs agree, 
except that the cDNA of XVII2463 does not contain 
single base deletions at positions 1005 and 1106, 
which were detected in the cDNA of XVII2115. 
Thus, XVII2463 contains, on a single EcoRI frag- 
ment, a Factor VII cDNA coding for Factor VII 
leader and mature protein sequences. 
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An additional cDNA, XVII565, was isolated and 
found to contain 5' terminal Factor VII sequences, 
but was truncated within the coding sequences. Its 
5* end maps at nucleotide 9 (Figure 1 b). 

When compared with full-length XVII2463. 
XVII565 was found to lack a sequence correspond- 
ing to one exon-like region within the leader se- 
quence. Bases 100-165 are absent from XVII565 - 
(Figure 1 b). The absent sequences correspond pre- 
cisely to one exon-like region by comparison with 
genomic sequence data (as described in Example 
III). Thus, the XVII565 structure may be a con- 
sequence of alternative splicing events in the lead- 
er sequence. 

The leader encoded by XVI 12 463 is exception- 
ally long (60 amino acids) and has a very different 
hydrophobicity profile when compared with Factor 
IX, protein C and prothrombin. This leader contains 
two Mets, at positions -60 and -26. Initiation most 
likely begins at the first Met, since a hydrophobic 
region, typical of signal peptides, follows the Met at 
position -60, but not the Met at -26. It is interesting 
that the absent sequences in XVII565 , which cor- 
responds precisely to an exon-like region in the 
genomic clone, results in a 38 amino acid leader 
with a hydrophobicity pattern more analogous to 
the above proteins. 

Example 2: Animo Acid Sequence of Human Factor 
VII. 

The elucidation of the amino acid sequence of 
human Factor VII was desired in order to confirm 
the identity of putative cDNA clones, substantiate 
the sequence of Factor VII cDNA, provide informa- 
tion allowing for the synthesis of specific 
oligonucleotide probes to screen cDNA and 
genomic libraries for clones containing the 5' se- 
quence, and to construct a synthetic fragment en- 
coding the ami no-terminal portion of Factor VII. 
Although limited amino acid sequence was pro- 
vided by Kisiel and McMulien (ibid), more informa- 
tion was needed. 

Purified human Factor Vila (Kisiel and McMul- 
ien, ibid) was reduced and carboxymethylated by 
the method of Crestfield et al M i Biol. Chem. 238 : 
622, 1963. The light and heavy polypeptide chains 
of carboxymethylated Factor Vila were separated 
by high-performance liquid chromatography - 
(HPLC) on a Micro Pak C18 reverse phase column 
(Varian Corp.) by generating a gradient of 0.1% 
TFA in distilled water (A) and 0.1% TFA in acetoni- 
trile (B) from 0-40% B in 5 minutes, 40-80% B in 
25 minutes and 80-100% B in 5 minutes. Approxi- 
mately 300 picomoles of each peptide chain were 
analyzed by automated Edman degradation using a 



Gas-Phase Protein Sequencer (Applied 
Biosystems, Inc.). Eighteen and 29 residues were 
identi fied at the amino-termini of the heavy and 
light polypeptide chains, respectively. The amino- 

5 terminal sequence of the heavy chain of Factor Vila 
was consistent with that encoded by cONA clone 
pUCVII21 15 (Figure 2b). Amino acid residues are 
designated within Figures 2a and 2b by single 
letter code as follows: A, alanine; C, cysteine; D, . 

10 aspartic acid; E, glutamic acid; F, phenylalanine; G, 
glycine; H, histidine; I,, isoleucine; K, lysine; L, 
leucine; M, methionine; N, asparagine; P, proline; 
Q, glutamine; R, arginine; S, serine; T, threonine; V, 
valine; W, tryptophan; Y, tyrosine; X indicates an 

75 unknown residue and " indicates that the Gia resi- 
dues (y) were assigned by homology to the struc- 
tures of other known clotting factors and by the 
absence of any other phenylthiohydantoin-amino 
acid at those positions. The gaps (-) are placed to 

20 provide the best alignment among the sequences. 
In addition, the information indicated that the amino 
acids at positions five and nine were lysines and 
not threonine and arginine, respectively, as pre- 
viously reported (Kisiel and McMulien, ibid). The 

25 sequence analyses of the tight chain of Factor Vila, 
which originates from the amino-terminal region of 
Factor VII, fell short by approximately 6 residues to 
overlap with the structure encoded by the 5* end of 
cDNA clone pUCVII2115. 

30 To obtain additional sequence data, two 

nanomoles of the carboxymethylated light chain 
were digested for 12 hours by bovine chymotrypsin 
(1:100 w/w, enzyme: substrate) in 0.1 M ammo- 
nium bicarbonate, pH 7.8, at 37°C. The generated 

35 fragments were purified by HPLC on a Micro Pak 
C18 reverse phase column using the above sol- 
vents in a gradient of 0-30% B in 5 minutes, 30- 
60% B in 25 minutes and 60-80% B in 10 minutes. 
Peptides were identified by their U.V. absorption at 

40 220 and 280 nm. Lyophiiized peptides - 
(approximately 1 nanomole each) were analyzed by 
Edman degradation. The results (Figure 2b) con- 
firmed much of the cDNA sequence in the cor- 
responding region of clone pUCVII2115. In total, 

45 113 of 152 residues (75%) of the light peptide 
chain of Factor Vila were identified. This sequence 
is identical to that encoded by the known cDNA 
structure. Indirect evidence indicates Asn 145 is a 
site of carbohydrate attachment. 

50 
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Example 3: Cloning of the genomic Factor VII se- 
quence. 

As one approach to providing the 5* end se- 
quence lacking from the cDNA, a lambda phage 
library containing human fetal liver DNA (Lawn et 
al-, £sil 15: 1157-1174) was screened with nick 
translated Factor VII cDNA. A portion of the 
genomic library was plated on E. ££N LE392 (ATCC 
33572) to produce a total of 7.2 x 10* plaques 
(Maniatis et al., ibid, pp. 320-321). The phage 
plaques were adsorbed from the plates onto ni- 
trocellulose and hybridized with the 32 P-labeled 
cDNA according to the procedure of Benton and 
Davis (Science 196 : 180, 1977). Eight clones were 
obtained and plaque purified. 

Using a DNA fragment (Eco Rla-Xba I, Figure 
1) from the 5' end of the Factor VII cDNA - 
(XVII2115) and standard techniques (Maniatis et al., 
ibid) those genomic clones containing 5* end se- 
quences were identified. These phage were des- 
ignated 7m 1, 7m2 and 7m3. DNA was prepared 
from these recombinant phage and preliminary re- 
striction endonuclease maps derived. Phage 7ml t 
which gave the strongest hybridization signal, was 



used to generate a more extensive restriction nru, 
and to place the Eco Rl-Xba I cDNA sequences on 
this map by Southern blotting (Southern, J. Mol. 
Biol.98 : 503, 1975). 

5 In order to determine if phage 7m 1 contained 

the DNA sequences encoding the amino terminal 
amino acids of the Factor VII protein. Southern 
blots of phage DNA restriction digests were 
hybridized with mixtures of oligonucleotides whose 

to sequences were deduced from the Factor VII ami- 
no terminal amino acid sequence. Oligonucleotides 
ZC188, 2C360, and ZC401 (Table 1) were radioac- 
tively labeled with T, polynucleotide kinase and 
hybridized to the phage DNA blots at a few de- 

75 grees centigrade below their Tm (Wallace, R. B., et 
al., Nuc. Acids Res . 6: 3543-3557, 1979), The re- 
sults of this analysis indicated that a 3.7 kb Sst I 
fragment of 7m1 contained sequences hybridizing 
to these oligonucleotides. This Sst I fragment was 

20 subcloned into M13 for DNA sequence analysis. 
Results obtained using ZC360 as sequencing prim- 
er identified a region approximately 60 nucleotides 
in length, which corresponded to the amino-termi- 
nal protein sequence data. 
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TABLE 1 

Oligonucleotide Sequence 

2C87 TCC CAG TCA CGA CGT 

T G A 

ZC188 GCC GGG CTCA CTC CTC CA GAA GGC GTTGG 

C A G 



ZC212 


GAC 
TCA 


CTG 
TGG 


CAG 


GAT 


CCA 


TGC 


AGC 


GCG 


TGA 


ACA 


ZC213 


GAG 
GCG 


GCC 
CTG 


TGG 


TGA 


TTC 


TGC 


CAT 


GAT 


CAT 


GTT 


ZC217 


ATG 


AGA 


AGC 


GCA 


CGA 


AG 










ZC218 


CTC 


TGC 


CTG 


CCG 


AAC 












ZC235 


GAT 


CCA 


TGC 


AGC 


GC 












ZC249 


AGA 


ACA 


GCT 


TTG 


TTC 


TTT 


CA 








ZC275 


GCC 


CCC 


ATT 


CTG 


GCA 












ZC286 


CCA 


AAG 


AGG 


GCC 


AAC 


GCC 


TTC 


CTG 


GAG 


GAG 




CCT 


GGG 


AGC 


CTG 


GAG 


AGA 


GAG 


TGT 


ATT 


GAG 


ZC287 


AAT 


ACA 


CTC 


TCT 


CTC 


CAG 


GCT 


CCC 


AGG 


TCT 




CTC 


CAG 


GAA 


GGC 


GTT 


GGC 


CCT 


CTT 


TGG 




ZC288 


AGC 
TCG 


AGT 
AGG 


GTA 
CCA 


GCT 
GCG 


TCG 
ACG 


AGG 


AGA 


ACA 


GAG 


AGG 


ZC289 


AAT 


TCG 


TCG 


CTG 


GCC 


TCG 


AAA 


ACC 


TCT 


CTG 




TCC 


TCG 


AAG 


CTA 


CAC 


TGC 


TCC 








ZC333 


CAG 


CTT 


CGT 


CCT 


GTC 


GCT 


GGC 


CTC 






ZC336 


CCT 


CTT 


TGG 


GCC 


TGG 


TGA 










ZC360 


C 

CA 

T 


C 

TC 

T 


C 

TC 

T 


C 

TC 

T 


G 

TT 

A 


CA 










ZC401 


CGT 


AGC 


GTT 


CAG 


GCC 


CTC 


GAA 


GAT 


CTC 


GCG 




CTC 


CTC 


GAA 


GCT 


ACA 


C 
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Since genomic clone 7m 1 was known to con- 
tain 7kb of sequences upstream of exon 2, this 
clone was anticipated to encode Factor VII 5* - 
untranslated sequences and the leader sequences 
up to the amino acid position -17. In order to 



confirm that exon 1 was encoded within genom.. 
clone 7 ml, the leader sequence information from 
clones WII2463 and XVH565 was used to design 
oligonucleotides ZC528 and ZC529 (shown below). 



ZC528 



5 f TCA ACA GGC AGG GGC AGC ACT GCA GAG ATT 3 ' 



ZC529 



5 ' TTC CAC GGC ATG TCC CGT GTT TCT CCT CCT 3 ' 



These were used to probe 7m 1 DNA, and a subl- 
cone, 7SD, was found that hybridized to both 
oligonucleotides. Exon 1 was determined to be 
composed of two exonic sequences: exon 1a, 
which hybridized to ZC528 (corresponding to 
nucleotides 1 to 30 in WII2463), and exon 1b, 
which hybridized to ZC528 (corresponding to 
nucleotides 119 to 148 in XVM2463). The intron 
sequences flanking both exons 1a and 1b have 
been sequenced: 1a contains a consensus splice 
donor sequence at the 3* end of the exon, and 1b 
is flanked on each terminus with a consensus 
splice acceptor (upstream of 1b) or donor - 
(downstream of 1 b) sequence. The position of exon 
1a within genomic clone 7m 1 has been precisely 
mapped, while that of exon 1b has been mapped 
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30 



within a defined region. Exon 1b sequences are 
present in XVH2463, while XVII565 appears to be 
derived from RNA spliced between exon 1a and 
exon 2, looping out the 1 b exonic sequence. 

A variety of 7m 1 subclones in pUC and M13 
vectors were prepared to facilitate sequencing the 
remaining exons. Appropriate oligonucleotides de- 
signed from the cDNA sequences, which corre- 
spond to exons 1 through 7, were used to se- 
quence all but the last exon. The genomic se- 
quence corresponds exactly to the cDNA se- 
quences through these regions. In addition, the 
intron/exon boundaries for exons 1-7 have been 
determined, and most are now precisely mapped 
within clone 7m 1. The intron sizes and positions 
within the Factor VII gene are listed in Table 2. 



35 



TABLE 2 

Intron/Exon Junctions in the Factor VII Gene 



Intron 



Amino Acid Position 



Intron Size (Kb) 



A 


-39 


>0.2 


B 


-17 


>1.0 


C 


37/38 


1 .92 


D 


46 


0.068 


E • 


84 


-2 


F 


131 


-1 


G 


167/168 


0.56 


H 


209 


1.31 



16 



+ • 

29 0 200 421 30 



Phage 7m 1 was known to lack the Factor VII 
3*-terminus, which includes exon 8. In order to 
obtain these sequences, a 12-13 kb-enriched Bam 
HI library in XL47.1 (Loenen and Brammer, Gene 
Ifl: 249* 1980; Maniatis, et al.. ibid .), derived from 
human dermal primary fibroblast cells, was probed 
with two nick-translated Factor VII cONA Pstl frag- 
ments (corresponding to sequences in exon 7, and 
to 3'-untranslated sequences). A clone, designated 
7DC1, was detected by both probes. Subsequent 
restriction endonuclease and Southern blot analysis 
established that clone 7DC1 overlaps with, and 
extends approximately 3 Kb beyond the terminus 
of, clone 7m 1, and that it contains exon 8. The 3.9 
Kb (Xbal-BamHI) fragment from 7DC1 DNA con- 
taining exon 8 was subcloned into M13, and se- 
quence analysis was performed using 
oligonucleotides complementary to its 5* and 3* 
termini. The entire exon sequence is present in this 
clone. 

Example 4: Factor IX-Factor VII Hybrid Genes Con- 
taining a Synthesized Coding Sequence. 



A. Construction of a hybrid Factor IX leader-syn- 
thetic Factor VII 5' coding sequence. 

The second alternative for obtaining the 5' cod- 
ing sequence for Factor VII was synthesis of an 
appropriate double-stranded fragment, using a 
nucleotide sequence predicted on the basis of the 
amino terminal amino acid sequence of Factor VII, 
the amino acid sequences of other vitamin K-de- 
pendent clotting factors, and the known nucleotide 
sequences of other vitamin K-dependent clotting 
factor genes (Kurachi and Davie, ibid; Anson et al., 
EMBO J. 3: 1053-1060,1984; and Davie et al., ibid). 
In order to provide the necessary secretion and 
processing signals for secretion of a mature Factor 
VII analog, this synthetic fragment (the consensus 
sequence) was joined to one of two leader se- 
quences derived from a Factor IX cDNA clone. 
This strategy is outlined in Figure 3. 

A cDNA coding for human Factor IX was ob- 
tained from a library made with mRNA from human 
liver (Kurachi and Davie, ibid). The Factor IX se- 
quence was isolated from the pBR322 vector by 
digestion with Pst 1 and was inserted into the Pst I 
site of pUC13. This piasmid was designated FIX- 
PUC13. In order to remove the G-rich region which 
was present at the 5' end of the Factor IX insert as 
a result of cDNA cloning, a synthetic 
oligonucleotide adaptor was substituted for the 5' 
end of the cloned fragment. Oligonucleotides 
ZC212 and 2C213 (Table 1) were synthesized and 



annealed to generate a 22 base pair overlap, the 
fragment ends filled in and cut with appropriate 
restriction endonucleases, and the resulting frag- 
ment was joined to the Factor IX sequence. 

5 To construct the adaptor, 100 pmoles each of 

ZC212 and ZC213 were lyophilized and resuspen- 
ded in 10 ui of 10x kinase/ligase buffer (600 mM 
Tris pH 8.0, 100 mM MgCI„ 100 nM DTT) plus 86 
ul H 3 0. The annealing reaction was run at 65 °C for 

to 10 minutes, the mixture was slowly cooled to room 
temperature and put on ice. To this mixture was 
added 4 ul of 2.5mM dNTP mix and 1 ul (8 units) 
T« DNA polymerase. The reaction was allowed to 
proceed 45 minutes at 14°C. Ten ul of 5 M 

rs NH4OAC was then added and the DNA was ex- 
tracted once with phenol/CHCI 3 , twice with CHCI 3l 
and was precipitated with ethanot. The DNA was 
centrifuged and resuspended in 100 ul medium salt 
buffer (Maniatis et al., ibid, p. 100), digested with 9 

20 units Pst I and 8 units Cfo I, and extracted as 
above. 

The modified Factor IX sequence was then 
constructed by combining 0.16 pmoles of the syn- 
thetic Pst l-Cfo I adaptor fragment, 0.14 pmoles of 

25 "a 1.4 kb Cfo l-Bam HI Factor IX fragment from FIX- 
PUC13, and 0.14 pmoles of a 2.7 kb Bam Hl-Pst I 
PUC13 vector fragment in a 20 ul reaction contain- 
ing 60 mM Tris-HCI pH 7.5, 10 mM MgCI 2 , 10 mM 
DTT, and 0.9 units T 4 ligase. The reaction was 

30 incubated for 3 h at room temperature and used to 
transform competent E. coli JM83 (Messing, Re- 
combinant DNA Technical Bulletin, NIH Publication 
No. 79-99, 2, No. 2, 43-48, 1979). The cells were 
plated with 50 ul of 2% X-gal (5 bromo-4-chloro-3 

35 idolyl-£-D-galactoside) on L-broth containing 40 
ug/ml. ampicillin and incubated at 37°C overnight. 
White colonies were picked onto another plate con- 
taining ampicillin and grown at 37° C overnight. The 
colonies were blotted on Whatman 540 paper and 

40 the paper prepared for hybridization according to 
the method of Wallace et al. (Gene 16: 21, 1981), 
except the overnight incubation on chloramphenicol 
plates was omitted. The papers were incubated at 
44°C for 2 h in 0.9 M NaCI, 0.09M Tris-HCI pH 7.5. 

45 6 mM EDTA, 0.5% Nonidet P-40, 150 ug/ml E. 02N 
tRNA. The papers were probed with 32 P-labeled 
ZC235 (Table 1), a 14-mer that is specific for the 
altered 5 ? end sequence. Hybridization with 1-2 x 
10 6 cpm per filter was carried out at 44°C in the 

50 prehybridization buffer overnight. The filters were 
then washed 3 times in 6 x SSC, 0.1% SDS at 4°C 
and 3 times in 2 x SSC, 0.1% SDS at 44°C and 
exposed to X-ray film. Two positive clones were 
obtained. One of these clones was designated FIX 

55 (-G) - pUC13. 
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In order to confirm the sequence of the altered 
region of the Factor IX portion of the FIX(-G) — 
pUC13 construct, dideoxy sequencing directly on 
the pUC plasmid using the BRL reverse primer was 
performed using the method of Wallace et al., 1981 
(ibid) using a primer end labeled with poly- 
nucleotide kinase and y^P ATP by the method of 
Chaconas et al. (ibid). The sequence was as pre- 
dicted. 

The resulting recombinant plasmid contains 
three Hae III cleavage sites, the first at position 39 
in the Factor IX sequence (numbering is based on 
the published sequence of Anson et ai. (ibid), be- 
ginning at the first ATG), the second at position 
130, and a third in the pUCl3 poly linker. The site 
at 130 is a single base pair upstream from the 
codons for the Lys-Arg processing site of the prepo 
Factor IX molecule. In the final Factor IX-Factor VII 
hybrid constructs, the Factor IX leader sequence, 
terminated at position 39 or 130, was joined to a 
synthetic double-stranded fragment comprising the 
predicted consensus sequence and the last 3 
codons of the Factor IX leader sequence. 

The synthetic consensus fragment was pro- 
duced by joining oligonucleotides ZC286-ZC289 - 
(Table 1 ) to form a double-stranded fragment One 
hundred pmole of each oligonucleotide was 
lyophilized and resuspended in 20 ul of 1x kinase 
buffer and incubated overnight at 4°C; then heated 
at 65°C for 10 minutes. Two pools were made 
using the kinased oligonucleotides. Pool 1 con- 
tained ZC286 + ZC287; pool 2 contained ZC288 
+ ZC289. The pooled pairs were annealed 10 
minutes at 65 °C, then cooled to room temperature 
over a period of 2 hours and placed on ice for 30 
minutes. 

The modified Factor IX fragment was removed 
from FIX(-G)-* pUCl3 as a Hind lll-Eco Ri frag- 
ment. Approximately 20 ug of plasmid was di- 
gested with 30 units each of Hind III and Eco RI in 
100 ul Hind III buffer (BRL) containing 4 ug RNase 
A at 37° C overnight. The reaction was terminated 
by heating at 65 °C for 10 minutes, and the vector 
and Factor IX fragments were electrophoresed on a 
1% agarose gel and purified by electro-elution. The 
Factor IX fragment was precipitated with ethanol, 
resuspended in buffer containing 400 ng/ul RNase 
A, and digested with 9 units of Hae III overnight at 
37°C. The Hind lll-Hae HI 39 base pair Factor IX 
fragment was isolated from this digest by elec- 
trophoresis on a 1.5% agarose gel followed by 
electro-elution. To obtain the Hind lll-Hae III 130 
base pair Factor IX fragment, FIX-pUCl3 was di- 
gested with Eco RI and Hind III and the Factor IX 
fragment isolated as above. Approximately 3 ug of 
this Hind lll-Eco RI fragment was digested with 6 
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units of Hae III at 37°C and aliquots were remove 1 
at five minute intervals over 30 minutes into a 
solution containing 50 mm EDTA. The aliquots 
were pooled and the Hind lll-Hae III 130 base pair 

s fragment was purified by electrophoresis on a 5% 
acrylamide gel followed by electro-elution. 

The final Factor IX-consensus sequence hy- 
brids were prepared by joining, in a four-part liga- 
tion, oligonucleotide pools 1 and 2, Factor IX Hind 

io lll-Hae III (39 or 130 base pairs), and pUC13 Hind 
lll-Eco RI. The resulting plasmids were used to 
transform E. goto HB101 (ATCC 33694). Colonies 
were screened by digestion of DNA with Eco RI 
and Hind III. The sequence comprising the 39 base 

75 pair Factor IX sequence joined to the synthetic 
consensus sequence is hereinafter referred to as 
mini-FIX-FVII. The plasmid containing this construct 
was designated pM7200(-C). The sequence com- 
prising the 130 base pair Factor IX sequence 

20 joined to the synthetic consensus sequence is re- 
ferred to as maxi-FIX-FVII. The plasmid containing 
this construct was designated pM7100(-C). The 
consensus sequence encodes a polypeptide com- 
prising the amino acid sequence Ala-Asn-Aia-Phe- 

25 Leu-Gla-Gia-Arg-Pro-Gly-Ser-Leu-Gla-Arg-Gla-Cys- 
Lys-GJa-Gln-Cys-Ser-Phe-Gla-Gla-Ala-Arg-Gla-lle- 
Phe-Gla-Gly-Leu-Asn-Arg-Thr-Lys-Leu. 

B. Joining Factor IX-consensus sequence hybrid 
30 fragment to Factor VII cDNA clone. 

The Factor IX-consensus sequence hybrids 
(either mini or maxi) were joined to the 5* portion of 
the Factor VII cDNA and the vector pUC13 in a 

35 three-part ligation (Figures 4 and 5). The vector 
fragment was produced by digesting 6 ug of 
pUC13 with 10 units each of Xba I and Hind III in 
Hind III buffer containing RNase A (400 ng/ul). The 
mini-FIX-FVII fragment was produced by digesting 

40 2 ug of pM7200(-C) with 10 units each of Hind III 
and Eco RI as above. The maxi-FIX-FVII fragment 
was similarly prepared from pM7100(-C). The 5' 
portion of the Factor VII cDNA was prepared from a 
plasmid (pUCG705) comprising the Eco Rl-Xba I 5' 

45 fragment of pUCVII2115 subcloned into pUC13 by 
digestion with Xba I and Eco RI. Digests were run 
at 37°C for 2 hours and the products were sepa- 
rated by electrophoresis on a 1.5% agarose gel. 
The desired fragments were electro-eiuted, extract- 

50 ed with phenol/CHCI 3 and CHCL and precipitated 
with ethanol. The three fragments, pUCl3/Xba l- 
Hind III, Factor IX-Factor VII (mini or maxi)/Hind lll- 
Eco RI, and 5* Factor Vll/Eco Rl-Xba I were then 
ligated in 20 ul of ligase buffer containing 2 ul 20 

55 mM ATP and 0.9 unit T 4 DNA ligase overnight at 
4°C. Colonies were screened by restriction analy- 
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sis with Hind III and Xba I. The recombinant plas- 
mids containing the mini-and maxi-FIX-FVII se- 
quences were designated pM7200 and pM7100, 
respectively (Figure 4). 

Due to the linker addition used in producing 
the Factor VII cDNA, modifications had to be made 
in the fusion sequences to generate correct in 
frame coding sequences. Both mini-and maxi-fu- 
sions contain an Ecd Rl site at the junction be- 
tween the Factor IX-consensus sequence hybrid 
and the Factor VII cDNA which is an artifact of the 
cDNA cloning process. In addition, the mini-fusion 
requires the addition of a C to change the se- 
quence at the Hae III site from 5 AGGCCA 3 to 
5 AGGCCCA 3 ' and establish the correct reading 
frame downstream of this sequence. These correc- 
tions were made by oligonucleotide-directed site 
specific mutagenesis, essentially as described for 
the two-primer method by Zoller and Smith - 
(Manual for Advanced Techniques in Molecular 
Cloning Course , Cold Spring Harbor Laboratory. 
1983). The mini-FIX-FVII fragment was removed 
from pM7200 by digestion with Hind III and Xba I 
and inserted into M13mp19, The maxi-FIX-FVII 
fragment was purified from pM7100 and subcloned 
in a similar manner. The mutagenic primers ZC333 
and ZC336 (see Table I) were used for removal of- 
the Eco Rl site and the base insertion, respectively. 
In each case, the universal primer ZC87 was used 
as the second primer. The mutagenic primers were 
phosphorylated by combining 40 pmoles of primer 
and 60 pmoles ATP with 1 unit of T 4 DNA kinase 
overnight at 60°C. To remove the Eco Rl site from 
the maxi-FIX-FVII hybrid, 1 ug of the M13 single- 
stranded template was combined with 20 pmoles 
each ZC333 and ZC87 in a total volume of 10 ul. 
The primers were annealed to the template for 10 
minutes at 65 °C, cooled to room temperature for 5 
minutes, then placed on ice for 5 minutes. The 
primers were extended using DNA polymerase I - 
(Klenow fragment). To remove the Eco Rl site and 
correct the reading frame in the mini-FIX-FVII hy- 
brid, 1 ug of the appropriate M13 single-stranded 
template was combined with 20 pmoles each 
ZC333,. ZC336 and ZC87. Annealing and primer 
extension reactions were carried out as described 
above. Plaque lifts were screened with 32 P-labeled 
primer (ZC333 or ZC336) at 60 °C and sequences 
confirmed by dideoxy sequencing. The resultant 
constructs, comprising the maxi-and mini-FIX-FVII 
sequences, were designated pM71H and pM721l, 
respectively. 

The consensus sequence contains several re- 
gions which do not conform to the protein se- 
quence data obtained for Factor VII (Figure 2). In 
order to produce a sequence which encodes a 
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polypeptide with greater homology to the amino- 
terminal portion of Factor VII, the consensus se- 
quence was altered by oligonucleotide-directed 
site-specific mutagenesis. The changes made were 

5 the insertion of Leu at position 8, substitution of lie 
'for Lys at position 18 (numbers refer to the amino 
acid position after the inseertion at position 8), Asn 
for Ala at position 26, and the sequence Ala-Ser- 
Asp for Gly-Leu-Asn at positions 32-34 (based on 

io tentative amino acid sequence data). 

The sequence changes at positions 8 and 18 
were made using pM7lH (sense strand) as tem- 
plate. Primers ZC352 ( 5> CCC AGG TCT CAG CTC 
CTC CAG 3 ) and ZC353 ( 5 CTG CTC CTC CTT 

75 ACA CTC TCT 3 ' ) were annealed to the template 
and extended as described above. The resultant 
phage clone was designated pM7114. The se- 
quence of the insert in pM7114 was confirmed by 
dideoxy sequencing. 

20 In a similar manner, the changes at positions 

26-34 were made on the pM7114 template (sense 
strand) using the mutagenic primer ZC366 ( 5 CAG 
CTT CGT CCT GTT CAG GCC CTC GAA GAT 
CTC GCG GGC CTC CTC GAA 3 ) and ZC87 (Table 

25 1) as second primer. The resultant construct was 
designated pM7H5. The sequence of the entire 
550 bp insert in the M13 vector was determined by 
dideoxy sequencing and found to be correct. 

30 Example 5: Construction of Factor IX-Factor VII 
cDNA fusion. 

The Factor IX-Factor VII cDNA fusion was pre- 
pared using Factor IX cDNA obtained from a hu- 

35 man liver cDNA library as described by Kurachi 
and Davie (ibid) and the Factor VII cDNA sequence 
described in Example 1 . 

The fusion point chosen for the hybrid protein 
was between amino acid + 38 (threonine) of Factor 

40 IX and the first lysine encoded by the Factor VII 
cDNA sequence. Such a protein would be encoded 
by a sequence consisting of the first 252 bp of the 
Factor IX cDNA sequence and all of the 
pUCVII2115 Factor VII cDNA sequence except the 

45 first two codons. To construct this hybrid se- 
quence, the Factor IX sequence was first fused to 
pUCVII2115 using convenient restriction sites. This 
fusion resulted in the plasmid FIX/VII/12 (described 
below) which contains the first 310 bp of the Factor 

so IX cDNA joined to the entire Factor VII cDNA 
sequence. To achieve the precise junction desired 
for the hydrid protein, the intervening base pairs 
were removed by oligonucleotide-directed 
mutagenesis. 

55 
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t Joining of the Factor IX cDNA sequence to 
Factor VII cDNA sequence was accomplished by 
ligating a 0.3 kb Hind Ill-Ana III fragment of FIX (- 
G) — pUC!3 (Example 4) to a 4.7 kb Sma l-Hind III 
fragment from pUCVII2l15 (Figure 5). The Hind Ill- 
Ana III fragment was prepared by digesting 3 ug of 
FIX(-G) — PUC13 with 40 units of Hind III in 40 ul of 
medium salt buffer (Maniatis et al., ibid) at 37°C, 4 
hours. The volume was then increased to 100 ul of 
medium salt buffer, and 5 units of Aha 111 were 
added and the 37°C incubation continued for 18 
hours. The DNA fragments were separated by elec- 
trophoresis in 1% agarose and the 0.3 kb band 
isolated as described above. A Sma I parital diges- 
tion of pUCVII2115 was obtained by incubating 3 
ug of pUCVII2115 at 25°C for 1 hour with 4.8 units 
of Sma I in a reaction volume of 30 ul. The reaction 
was stopped by a 15-minute incubation at 65°C. 
The sample was then extracted once with an equal 
volume of phenol and ethanol precipitated. 

The precipitate was collected by a 10-minute 
microfuge spin, rinsed with 70% ethanol and air 
dried. The DNA was redissolved in 30 ul of me- 
dium salt buffer and digested with 30 units of Hind 
111 at 37°C for 3 hours. The DNA was subjected to 
electrophoresis in 0.7% agarose and the 4.7 kb 
Hind Ill-Sma I fragment isolated as described 
above. Equimolar amounts of the two fragments 
(0.048 pmoles) were ligated in a 10 ui reaction 
containing 50 mM Tris-HCI pH 7.5, 10 mM MgCI 2 . 
1 mM DTT, 1 mM ATP, and 3 units of T 4 DNA 
ligase at 14°C for 3.5 hours and then used to 
transform competent E. coH RRI (ATCC 31343). 
The cells were grown on ampicillin plates and 12 of 
the resulting colonies were screened by restriction 
enzyme digestion for the presence <of the desired 
plasmid construction. DNA from colony 12 
(FIX/VII/12) gave the expected restriction enzyme 
digestion pattern and was used in the next step of 
the hybrid gene construction. 

The oligonucieotide-directed mutagensis proce- 
dure was performed on a single-stranded DNA 
template. Thus, it was necessary to clone the fused 
Factor IX/Factor VII sequences into M13mp19. To 
obtain a conveniently small DNA fragment, a 640 
bp Hind Ill-Xba I fragment was isolated from 
FIX/VII/12. This fragment contains 310 bp of the 5' 
end of Factor IX cDNA and 330 bp of the Factor VII 
sequence. The vector was prepared by digesting 1 
ug of M13mp19 RF DNA with 20 units of Hind III 
and 20 units of Xba I in 40 ui of medium salt buffer 
at 37°C for 18 hours. The DNA was subjected to 
electrophoresis in 1.2% agarose and the linear 6.4 
kb fragment isolated from the gel as described 
above. Five ug of FIX/VII/12 DNA was digested with 
10 units of Xba I in 40 ul of medium salt buffer at 
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37° for 18 hours. Twenty units of Hind III wer^ 
added and the digestion continued at 37°C for an 
additional 7 hours. The resulting fragments were 
separated by electrophoresis in 1.2% agarose and 

5 the 640 bp fragment eluted as above. Ten ng of 
linearized m13mp19 and 1 ng of the 640 bp frag- 
ment were ligated at 14°C for 1 hour and then 
used to transform competent £. sfili JM101 - 
(Messing, Meth. in Enzvmoloav . ibid). The cells 

70 were plated with X-gal and IPTG (Messing, Meth. in 
Enzvmoloav . ibid) and eight light blue plaques 
were picked and used to infect 2.5 ml cultures of 
E. £0H JM103 at Aw = 0.3. After 18 hours 1 growth 
at 37°C, the cells were harvested by centrifugation 

75 in a room temperature clinical centrifuge and 20 ul 
of the supernatant which contains the M13 phage 
was mixed with 10 ug/1 ethidium bromide. By 
comparison with known standards, each of the 
eight clones had an insert of approximatley the 

20 correct size. Single-stranded DNA was then pre- 
pared from 1 .5 ml of the supernatants as described 
by Messing (Meth. in Enzvmoloav , ibid). This con- 
struct was then sequenced by the dideoxy method 
using the oligonucleotide ZC87 as a primer to 

25 confirm that the insert junction was correct. One of 
the correct clones (#4) was used as a template in 
oligonucieotide-directed mutagenesis to produce a 
functional Factor IX-Factor VII fusion. 

The oligonucleotide ZC249, a 20-mer consist- 

30 ing of 10 bp of the desired Factor IX sequence and 
10 bp of the desired Factor VII sequence (Table I) 
was used as the mutagenic primer. The 
oligonucleotide ZC87, which hybridizes to the 
M13mp19 sequence, was used as the second 

35 primer. 

The mutagenesis procedure was modified from 
that of Zoller and Smith (ibid). For the annealing 
reaction, 20 pmoles of ZC249 were phosphorylated 
by incubating overnight at 4°C in 20 ul 60 mM 

40 Tris-HCI pH 8.0, 10 mM MgCI*. 1 mM DTT, 1mM 
ATP, 1 unit T« kinase. The reaction was stopped by 
incubation at 65°C for 15 minutes, and the sample 
was lyophlized. One pmole of single-stranded 
clone #4 template and 20 pmole of ZC87 were 

45 added in 10 ul annealing buffer (200 mM Tris-HCI 
pH 7.5, 100 mM MgCI 2 , 500 mM NaCI, 10 mM 
DTT). The sample was heated to 65 °C for 10 
minutes, incubated at room temperature for 5 min- 
utes, and then placed on ice. Ten ul of the follow- 
so ing solution was prepared fresh and added to the 
sample: 20 mM Tris-HCI pH 7.5, 10 mM MgCI 2 , 10 
mM DTT, 1 mM dNTPs, 1 mM ATP, 0.15 units/ul 
T 4 DNA ligase, 0.25 units/ul E. co}i DNA Poly- 
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merase I (Klenow fragment). The reaction was then 
incubated at 15°C for 3 hours and the sample used 
to transform competent E. £2l[ JM101 (Messing, 
Meth. in Enzvmoloav , ibid). 

The resulting plaques were fitted onto nitrocel- 
lulose and screened by hybridization to ^P-labeled 
ZC249. Dry BA85 filters (Schliecher & Schuell. 0.45 
urn) were laid onto the agar plate and the phage 
allowed to adsorb for 5 minutes." The filters were 
removed and allowed to dry for 5 minutes, placed 
on Whatman 3 MM paper, saturated in 0.5 M 
NaOH, 1.5 M NaCI for 5 minutes, air dried for 3 
minutes, placed on Whatman paper, saturated in 1 
M Tris-HCI pH 8, 1.5 M NaCI, for 5 minutes, and 
air dried for 3 minutes. The Tris-HCI step was 
repeated and the filters were rinsed in 100 ml 6 x 
SSC for 2 minutes at room temperature. After air 
drying, the filters were baked at 80°C for 2 hours 
and prehybridized at 47° C (T m -4° of ZC249) over- 
night in 6.7 x SSC pH 6.5. 2 mg/ml E. £Oli tRNA, 
and 0.2% (w/v) each BSA, Ficoll, and polyvinyl- 
pryrolidine. ; 

After the prehybridization step, the filters were 
incubated with 2.5 x 10 6 cpm/filter of labeled 
ZC249 in the same SSC hybridization buffer at 
47 °C overnight. Following hybridization, the filters 
were washed 3 times, 5-10 minutes each, at room 
temperature in 6 x SSC and exposed to X-ray film. 
Putative positive plaques were replated and 
screened as above. Individual plaques were then 
picked, and single-stranded DNA was prepared and 
sequenced using ZC275 as a primer. The 
oligonucleotide ZC275 corresponds to a sequence 
40 bp in the 5* direction of ZC249 on the same 
strand (Tablel). 

Four positive plaques were identified. The en- 
tire insert in M13mp19 for one clone (FIX/VII-9) was 
sequenced by the dideoxy method using the 
oligonucleotides ZC87 and ZC275 and determined 
to be correct. The confirmed sequence is repre- 
sented by bases 1-567 in Figure 7. RF DNA from 
this clone was then used for the final step in the 
construction of the hybrid gene. 

Three fragments were used to make the final 
construction: the 0.6 kb Hind Ill-Xba I fragment 
from FIX/VII-9 containing the fused IX/VII se- 
quences; a 1.7 kb Xba l-Bam HI Factor VII cDNA 
fragment from pUCVII1923; and a 2.7 kb Bam Hl- 
Hind III fragment of pUCl3. Three ug of FIX/VII-9 
(RF DNA) were digested at 37° C for 6 hours with 
45 units of Xba I in a volume of 50 ul. The DNA 
was precipitated with ethanol, resuspended and 
digested at 37°C for 4 h with 50 units of Hind III. 
The sample was subjected to electrophoresis in 



1% agarose and the 0.6 kb band electro-el uted 
from the paper with 1.5 M NaCl t 50 mM Tris-HCI 
pH 8, 1 mM EDTA, phenol extracted and precipitat- 
ed with ethanol. 

5 To obtain the remaining Factor VII cDNA se- 

quence, 5 ug of PUCVII1923 was digested at 37°C 
for 3 hours with 36 units of Xba I in 40 ul of 
medium salt buffer. Then 8 ul of 10x high salt 
buffer, 28 ul of H 2 0, and 4 ul (40 units) of Bam HI 

to were added and the reaction incubated at 37°C for 
3 hours. The DNA fragments were separated by 
electrophoresis in 1 % agarose and the 1 .7 kb frag- 
ment isolated as described above. 

The vector fragment was prepared by digesting 

75 1 ug of pUCl3 with 10 units of Hind III in 20 ul of 
medium salt buffer at 37°C for 1 hour. Two ul of 
10x high salt buffer and 10 units of Bam HI were 
then added and the incubation continued for an- 
other 2 hours. The DNA was purified on a 1% 

20 agarose gel as described above. 

Equimoiar amounts (approximately 0.56 
pmoles) of the three fragments were ligated at 
room temperature for 45 minutes in 10 ul of 50 mM 
Tris-HCI pH 7.5, 10 mM MgCI*. 1 mM DTT, 1mM 

25 ATP and 3 units T» DNA ligase. The reaction mix- 
ture was used to transform competent E. coH 
JM83. The cells were plated on medium containing 
40 ug/ml ampicillin with 50 ul of 2% X-gal added to 
each plate. DNA was prepared from 7 white colo- 

30 nies and then screened by restriction enzyme di- 
gestipn^One of the clones giving the correct pat- 
tern was designated FIX/VII — pUC13. 

Example 6: Expression of Biologically Active Factor 
35 VII Analogs. 

The mammalian cell expression vector pD2 
was chosen for expression of the FIX/VII gene in 
transfected animal cells. It was constructed from 

40 plasmid pDHFR-lll (Berkner and Sharp, Nuc. Acids 
Res. 13: 841-857, 1985) in the following manner. 
The Pst I site abutting the DHFR cDNA in pDHFR 
III was converted to a Bam HI site by conventional 
tinkering (Scheller, R. H., Dickerson, R.E., Boyer, H. 

45 W. t Riggs, A.D., and Itakura, K., Science 196 : 177- 
180, 1977). The pDHFR III DNA was incubated with 
10 mM Tris pH 7.6, 6 mM /3-MSH, 6 mM NaCI, 10 
mM MgCI, and 2.5 units Pst I for 10 minutes at 
37 °C, followed by phenol extraction and ethanol 

so precipitation. The Pst I cohesive termini were blunt 
ended using T« DNA polymerase. After phenol ex- 
traction and dialysis against 10 mM Tris pH 8.0, 1 
mM EDTA, 0.3 M NaCI, the DNA was ethanol 
precipitated. The DNA was resuspended in 20 ul 

55 1.4 mM ATP, 50 mM Tris pH 7.6. 10 mM MgCI,. 1 
mM dithiothreitol and then incubated with 5 ng of 
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T 4 polynucleotide kinase-treated Bam HI linkers - 
(New England Biolabs) and 200 units of T 4 poly- 
nucleotide ligase for 12 hours at 12°C, followed by 
phenol extraction and ethanol precipitation. The 
DNA was digested with 90 units of Bam HI at 37° C 
for 1 hour, followed by electrophoresis through a 
1.4% agarose gel. The 4.9 kb DNA fragment 
(corresponding to pDHFR III DNA lacking the 
DHFR cDNA and SV40 polyadenylation signal) was 
electro-eluted and recircularized with poly- 
nucleotide ligase and then transfected into E. £2!i 
HB101. Ampiciliin-sensitive colonies were screened 
by rapid prep analysis (Birnboim, H.C., and Doly, 
J., Nucleic Acids Research 7: 1513-1523, 1979) 
and the correct clone was grown up to generate a 
large-scale plasmid DNA preparation. 

The resultant plasmid was cleaved with 20 
units Bam HI and treated with 2.5 ug calf intestinal 
phosphatase and electrophoresed on a 1.4% 
agarose gel. Twenty-five ug of pSV40 (a clone of 
SV40 DNA inserted into the Bam HI site of 
pBR322) were digested with 25 units of Bel I for 1 
hour at 50°C, followed by the addition of 25 units 
of Bam HI, and the incubation continued for 1 hour 
at 37°C. This DNA was then electrophoresed on a 
1.4% agarose gel. The Bam Hl-cut vector (i.e., that 
lacking the polyadenylation signal) was joined to 
the SV40 DNA fragment (.14 to .19 map units - 
[Tooze, J., ed., "DNA Tumor Viruses, Molecular 
Biology of Tumor Viruses"]) containing the late 
polyadenylation signal by incubating the gel-puri- 
fied fragments (0.1 ug each) in 20 ul 50 mM Tris 
pH 7.6, 10 mM MgCI 2 , 1 mM dithiothreitol, 1.4 mM 
ATP and 100 units T 4 polynucleotide ligase for 4 
hours at 12°C, followed by transformation into E. 
coli RR1 . Positive colonies were identified by rapid 
prep analysis, and a large-scale plasmid prepara- 
tion of the correct DNA, pD2, was prepared. 

To make the Factor IX/VII expression construc- 
tion, 1 ug of pD2 was digested at 37°C for 1 hour 
with 20 units of Bam HI in 20 ul of high salt buffer. 
Twenty ul of 10 mM Tris-HCI pH 8, 1 mM EDTA 
and 0.1 unit of calf alkaline phosphatase - 
(Boeringer) were then added. The reaction was 
incubated at 37 °C for 1 hour and stopped by 
heating to 75 °C for 10 minutes. Ten ug of FIX/VH 
— pUC13 was digested at 37°C for 2 hours with 
150 units of Bam HI in 150 ul of high salt buffer. 
The DNA fragments were separated by elec- 
trophoresis in 1.2% agarose and the 2.3 kb frag- 
ment was isolated. Equimolar amounts (0.015 
pmoles) of the 2.3 kb Bam HI fragment and the 
pD2 vector fragment were ligated at 14°C for 2.5 
hours as above. The reaction mixture was used to 
transform E. coli RR1 cells, which were then plated 
on medium containing 10 ug/ml ampicillin. Plasmid 



DNA was prepared from 12 of the resulting coio 
nies and screened by restriction enzyme digestion. 
One of the clones with the correct enzyme diges- 
tion pattern was designated FIX/VII/pD2 (Figure 6). 
5 E. £0|i RR1 transformed with FIX/VII/pD2 has been 
deposited with ATCC under accession number 
53068. 

The procedure used to transfect baby hamster 
kidney (BHK) cells (available from 'American Type 

io Culture Collection, accession number CCL10) with 
FIX/VII/pD2 was similar to published methods (for 
example, Wigler et al.. CeU 14: 725, 1978; Corsaro 
and Pearson, Somatic Cell Genetics 7: 603, 1981; 
Graham and Van der Eb, Virology 52: 456, 1 973). 

75 The BHK cells were grown at 37° C, 5% CO,, in 
Dulbecco's media (plus 10% heat-inactivated fetal 
calf serum and supplemented with glutamine and 
penicillin-strep-tomycin) in 60 mm tissue culture 
Petri dishes to a confluency of 20%. A total of 10 

20 ug DNA was used to transfect one 60 mm dish: 
3.75 ug of FIX/VII/pD2, 1.25 ug of pKO-neo - 
(Southern and Berg, J. Mol. Aopl. Genet 1: 327- 
341, 1982) and 5 ug of salmon sperm DNA. The 
DNAs were precipitated in 0.3 M NaOAc, 75% 

25 ethanol, rinsed with 70% ethanol and redissolved in 
20 ul 10 mM Tris-HCI pH 8, 1 rrfM EDTA. The DNA 
was combined with 440 ul H,0 and 500 ul of 280 
mM NaCI, 1.5 mM NaHPO*. 12 mM dextrose, 50 
mM HEPES pH 7.12. Sixty ul of 2 M CaCI, were 

30 added dropwise to the above mixture and the solu- 
tion let stand at room temperature for 30 minutes. 
The solution was then added to the cells and the 
cells returned to 37°C for 4 hours. The medium 
was removed and 5 ml of 20% DMSO in Dul- 

35 becco's with serum were added for 2 minutes at 
room temperature. The dish was then washed rap- 
idly with 2 changes of medium and incubated in 
fresh medium overnight. Twenty-four hours after 
the DNA was added, the medium was removed and 

40 selective medium added (10 mg/ml of G418, 498 
ug/mg, Gibco, in Dulbecco's with serum). After 10 
and 13 days, individual clones, representing cells 
that had incorporated the pKO-neo gene and were 
thus resistant to G418, were transferred to 96-welt - 

45 (or 24-well) plates and grown up for protein assays. 

Cells were grown in Dulbecco's plus 10% fetal 
calf serum containing 5 ug/ml vitamin K - 
(Phytonadione, Merck). The medium was separated 
from the cells and cellular debris by centrifugation, 

so and assayed for Factor VII polypeptide (by ELISA) 
and for biological activity. The cells were removed 
from the plates with trypsin, washed with fresh 
medium, centrifuged, and frozen at -20° C. The cell 
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pellets were then thawed in PBS, pelleted, and 
resuspended in PBS containing 0.25% Triton X- 
100. Samples were diluted and assayed for poly- 
peptide and activity. 

The ELISA for Factor VII was done as follows. 
Two hundred microliters of a monoclonal antibody 
against human Factor VII (5 ul/ml in 0.1 M Na,C0 5 
pH 9.6) were incubated in each well of a 96-well 
microtiter plate 2 hours at 37° C. The wells were 
then incubated with 220 ul of 1% bovine serum 
albumin (BSA) and 0.05% Tween 20 in PBS pH 7.2 
2 hours at 37°C. The plates were rinsed with H 2 0, 
air dried, and stored at 4°C. To assay samples, 
200 ul samples were incubated 1 hour at room 
temperature in the antibody-coated wells. The wells 
were then rinsed four times with 200 ul PBS con- 
taining 0.05% Tween 20. The wells were then 
incubated for 1 hour at room temperature with 200 



70 



75 



ul of an IgG fraction of rabbit polyclonal antiserum 
against Factor VII (5 ug/ml in PBS containing 1% 
BSA and 0.05% Tween 20). This was followed by 
incubation with goat anti-rabbit IgG coupled to al- 
kaline phosphatase. The wells were then rinsed 
four times with PBS containing 0.05% Tween 20. 
To the wells were added 200 ul p-nitrophenyf phos- 
phate (30 mg) dissolved in diethanolamine buffer - 
(96 ml per liter) pH 9.8 containing 56 mg/l MgCI,. 
The enzyme reaction was done at 37 °C and the 
development of a yellow color was monitored at 
405 nm using an ELISA plate reader. Results ob- 
tained for cell media are given in Table 3. 

Factor VII biological activity was assayed by 
the one-stage clotting assay described by Quick • 
(Hemorraaic Disease and Thrombisis . 2nd ed., Leat 
Febiger, Philadelphia, 1966). Results obtained for 
cell media are given in Table 3. 
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TABLE 3 



Day 
1 



Cells/ml 
(xlO" 4 ) 

2.9 
2.7 



Factor VII 
polypeptide ng/ml 



25 



Factor VII 
activity (ng/ml) 



6.0 



1.9 
2.8 



47 



15.9 



1 .96 
2.26 



160 



93 



4.71 
4.14 



550 



300 



8.79 
11 .28 



725 



531 



5.1 
8.4 



975 



600 
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J J Example 7: Expression of Factor IX 

Fourteen ug of FIX(-G) — pUCl3 were di- 
gested with 30 units of Bam HI in 30 ul of high salt 5 
buffer for 3 hours at 37°C. The DNA was then 
subjected to electrophoresis in 1% agarose and the 
1.4 kb band confining the Factor IX sequence was 
isolated from the gel. 

Three ug of the vector pD2 were digested with io 
30 units of Bam HI in 30 ul high salt buffer for 3 
hours at 37°C. The DNA was subjected to elec- 
trophoresis in 1% agarose and the linear 1.5 kb 
fragment isolated. The DNA was then treated with 
0.12 units calf alkaline phosphatase in 3Q ul of 10 75 
mM Tris-HCI pH 8, 1 mM EDTA for 30 minutes at 
37°C. The salt was adjusted to 0.3 M NaOAc and 
the sample extracted twice with phenol, once with 
chloroform and the DNA was ethanol precipitated. 
The pellet was rinsed in 70% ethanol, dried and 20 
redissolved in 20 ul 10 mM Tris-HCI pH 8, 1 mM 



EDTA. Equimolar amounts (0.02 pmoles) of the two 
fragments were ligated with 10 units of T* DNA 
ligase as described above. The reaction mixture 
was used to transform E coli RR1 cells. DNA from 
twelve of the resulting ampicillin-resistant colonies 
was screened by restriction enzyme digestion. One 
of the clones with the 1.4 kb fragment inserted in 
the correct orientation was designated as FIX(-G)- 
/pD2. E £Oli RR1 transformed with FIX(-G)/pD2 has 
been deposited with ATCC under accession num- 
ber 53067. 

BHK cells were co-transfected with FIX(-G)/pD2 
and pKO-neo as described above. Drug-resistant 
cells were selected and prepared for ELISA and 
activity assay as described in Example 6. 

The assay for biological activity is based on 
the ability of Factor IX to reduce the clotting time 
of plasma from Factor IX-deficient patients to nor- 
mal. It was done as described by Procter and 
Rapaport (Amer. J. Clin. Path. 36: 212, 1961). Re- 
sults are shown in Table 4. 



TABLE 4 



Factor IX 
Cells/ml polypeptide (ng/ml) 
Day (xlO-4 ) supernatant pellet 



Factor IX 
activity (ng/ml) 
in supernatant 



% active 
protein in 
supernatant 



1 .65 



2 .66 



57 
45 



20 
20 



27 
24 



50% 



9 .69 



150 
120 



60 
60 



72 
84 



58% 



14.79 



475 
225 



160 
140 



198 
150 



50% 



50.85 



875 
1000 



250 
260 



408 
438 



45% 



The amount of Factor IX polypeptide was de- 
termined by ELISA essentially as described in Ex- 
ample 6 using polyclonal rabbit antisera to Factor 
IX. Following the incubation of the wells with the 
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Factor IX-containing samples, the wells were rinsed 
and incubatedl hour at room temperature with 200 
ul of affinity purified rabbit polyclonal anti-Factor IX 
conjugated to alkaline phosphatase diluted 1:1000 
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in PBS containing 1% BSA and 0.05% Tween 20. 
The wells were then rinsed four times with PBS 
containing 0.05% Tween 20, and enzyme substrate 
was added as above. Incubations were run at 4°C 
overnight or 37° C for 2 hours. 

As shown in Table 4, 70-80% of the Factor IX 
polypeptide is secreted into the media, and about 
50% of this is biologically active. No Factor IX 
activity was detected in the cell pellets. 

Highest levels of activity were achieved by 
supplementing the cell culture medium with vitamin 
K (phytonadione, Merck) at concentrations of 1-10 
mg/ml. 

Several additional analyses were performed to 
demonstate that the cells were secreting authentic 
Factor IX. Samples containing Factor IX activity 
according to the above assay were incubated with 
Factor Vlll-deficient plasma but did not affect the 
clotting time, indicating that the activity was due to 
authentic Factor IX rather than a non-specific clot- 
ting agent. This conclusion was further verified by 
depletion of Factor IX activity from the samples 
with a specific antibody. Ninety-seven to ninety- 
eight percent of the Factor IX activity was im- 
munoprecipitated from cell supernatants with a rab- 
bit polyclonal antibody against Factor IX. This anti- 
body also precipitated over 99% of the Factor IX 
activity from normal plasma. No Factor IX activity 
was removed from the supernatants by rabbit poly- 
clonal antibody to erythropoietin. 

Example 8: Construction of an expression vector 
for Factor VII. 

An expression vector comprising the synthetic 
Factor VII 5 1 coding region joined to the partial 
Factor VII cDNA was constructed. The vector, des- 
ignated pM7135 t was generated by inserting the 
Factor IX leader -5' Factor VII sequence from 
PM7115 and the 3* Factor VII sequence from 
FIX/VII/pD2 into plasmid pD3, which comprises the 
SV40 enhancer and the adenovirus 2 major late 
promoter and tripartite leader. 

Plasmid pD3 was generated from plasmid 
pDHFRIII. The Pst I site immediately upstream 
from the DHFR sequence in pDHFRIII was con- 
verted to a Bel I site by digesting 10 ug of plasmid 
with 5 units of Pst I for 10* at 37°C in 100 ul buffer 
A (10 mM Tris pH 8, 10 mM MgCI„ 6 mM NaCI, 
7mM 0-MSH). The DNA was phenol extracted. 
EtOH precipitated, and resuspended in 40 ul buffer 
B (50 mM Tris pH 8, 7 mM MgCU, 7 mM 0-MSH) 
containing 10 mM dCTP and 16 units T4 DNA 
polymerase and incubated at 12°C for 60 minutes. 
Following EtOH precipitation, the DNA was ligated 
to 2.5 ug kinased Bel I linkers in 14 ul buffer C (10 



mM Tris pH 8, 10 mM MgCI, f i mM DTT t 1.4 mM 
ATP) containing 400 units T4 polynucleotide ligase 
for 12 hours at 12°C. Following phenol extraction 
and EtOH precipitation, the DNA was resuspended 

5 in 120 ul buffer D (75 mM KCI. 6 mM Tris pH 7.5. 
10 mM MgCI,, 1 mM DTT), digested with 80 units 
Bel I for 60 minutes at 50°C, then electrophoresed 
through agarose. Form HI plasmid DNA (10 ug) was 
isolated from the gel, and ligated in 10 ul buffer C 

to containing 50 units T4 polynucleotide ligase for 2 
hours at 12°C, and used to transform E coli, 
HB101. Positive colonies were identified by rapid 
DNA preparation analysis, and plasmid DNA - 
(designated pDHFR*) prepared from positive colo- 
rs nies was transformed into dAM" E. coli . 

Plasmid pD2* was then generated by cleaving 
pDHFR* (15 ug) and pSV40 (25 ug) in 100 ul buffer 
D with 25 units Bel I for 60 minutes at 50°C, 
followed by the addition of 50 units Bam HI and 

20 additional incubation at 37 °C for 60 minutes. DNA 
fragments were resolved by agarose gel elec- 
trophoresis, and the 4.9 kb pDHFR' fragment and 
0.2 kb SV40 fragment were isolated. These frag- 
ments (200 ng pDHFR' DNA and 100 ng SV40 

25 DNA) were incubated in 10 ul buffer C containing 
100 units T4 polynucleotide ligase for 4 hours at 
12°C, and the resulting construct (pD2 f ) used to 
transform E. c^Si RRI. 

Plasmid pD2* was modified by deleting the 

30 "poison" sequences in the pBR 322 region (Lusky 
and Botcham, Nature 293 : 79-81. 1981). Plasmids 
pD2* (6.6 ug) and pML-1 (Lusky and Botcham. 
ibid) (4 ug) were incubated in 50 ul buffer A with 10 
units each Eco Rl and Nru I for 2 hours at 37° C, 

35 followed by agarose gel electrophoresis. The 1.7 
kb pD2* fragment and 1 .8 kb pML-1 fragment were 
isolated and ligated together (50 ng each) in 20 ul 
buffer C containing 100 units T4 polynucleotide 
ligase for 2 hours at 12°C, followed by transforma- 

40 tion into E. coli HB101. Colonies containing the 
desired construct (designated A pD2) were iden- 
tified by rapid preparation analysis. Ten ug of A 
pD2 were then digested with 20 units each Eco Rl 
and Bgl II, in 50 ul buffer A for 2 hours at 37° C. 

45 The DNA was electrophoresed through agarose, 
and the desired 2.8 kb fragment (fragment C) com- 
prising the pBR322, 3' splice site and poly A 
sequences was isolated/ 

To generate the remaining fragments used in 

50 constructing pD3, pDHFRIII was modified to con- 
vert the Sac II (Sst II) site into either a Hind III or 
Kpn I site. Ten ug pDHFRIII were digested with 20 
units Sst II for 2 hours at 37°C, followed by phenol 
extraction and ethanol precipitation. Resuspended 

55 DNA was incubated in 100 ul buffer B containing 
10 mM dCTP and 16 units T4 DNA polymerase for 
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60 minutes at 12°C, phenol extracted, dialyzed, 
and ethanol precipitated. DNA (5 ug) was ligated 
with 50 ng kinased Hind HI or Kpn I linkers in 20 ul 
buffer C containing 400 units T4 DNA ligase for 10 
hours at 12°C. phenol extracted, and ethanol 5 
precipitated. After resuspension in 50 ul buffer A, 
the resultant plasmids were digested with 50 units 
Hind III or Kpn I, as appropriate, and electrophores- 
ed through agarose. Gel-isolated DNA (250 ng) was 
ligated in 30 ul buffer C containing 400 units T4 io 
DNA ligase for 4 hours at 12°C and used to 
transform E. goM RRI. The resultant plasmids were 
designated pDHFRIII (Hind III) and pDHFRIII (Kpn 
I). A 700 bp Kpn l-Bgl II fragment (fragment A) was 
then purified from pDHFRIII (Kpn I) by digestion 75 
with Bgl II and Kpn I followed by agarose gel 
electrophoresis. 

The SV40 enhancer sequence was inserted 
into pDHFRIII (Hind III) as follows: 50 ug SV40 DNA 
was incubated in 120 ul buffer A with 50 units Hind 20 
III for 2 hours at 37°C, and the Hind III C SV40 
fragment (5089-968 bp) was gel purified. Plasmid 
pDHFRIII (Hind III) (10 ug) was treated with 250 ng 
calf intestinal phosphatase for 1 hour at 37°C, 
phenol extracted and ethanol precipitated. The lin- 25 
earized plasmid (50 ng) was ligated with 250 ng 
Hind III C SV40 in 16 ul buffer C for 3 hours at 
12°C. using 200 units T4 polynucleotide ligase, 
and transformed into E. £oli HB101. A 700 base 
pair Eco Rl-Kpn I fragment (fragment B) was then 30 
isolated from this plasmid. 

For the final construction of pD3, fragments A 
and B (50 ng each) were ligated with 10 ng frag- 
ment C with 200 units T4 polynucleotide ligase for 
4 hours at 12°C, followed by transfection of E. £0H 35 
RRI. Positive colonies were detected by rapid prep- 
aration analysis, and a large-scale preparation of 
pD3 was made. 

Expression vector pM7135 was then construct- 
ed. The replicative form of pM7H5 was digested 40 
with Bam HI and Xba I and the 550 base pair 
fragment comprising the Factor IX leader and 5 f 
Factor VII sequence was gel purified. Plasmid 
FIX/VII/pD2 was digested with Xba I and Bam HI 
and the 1700 bp fragment comprising the 3* portion 45 
of the Factor VII cDNA was gel purified. Plasmid 
pD3 was digested with Bel I, treated with calf 
alkaline phosphatase, and the three fragments 
joined in a triple ligation. The resultant constructs 
were screened for the presence of a 2000 base 50 
pair Xba I fragment. A plasmid having the correct 
orientation was selected and designated pM7135 - 
(Figure 8). 
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Example 9: Expression of Factor VII From cDNm 
Clones. 

In order to express Factor VII cDNA containing 
a Factor VII leader, DNA from WU2463 or XVII565 
and XVII2463 was cloned into an expression vector 
containing the Ad2 major late promoter, SV40 en- 
hancer sequences, the Ad2 tripartite leader, a 
splice set and the SV40 polyadenylation signal. 
This vector was adapted so that it contains a 
unique EcoRI sequence as the site of cDNA inser- 
tion. The expression of sequences from XVII2463, 
which encodes a 60 amino acid leader, and from 
XVII565 and XVII2463, which lacks the codons for 
amino acids from -18 to -39 and thus encodes a 
leader 38 amino acids in length, were evaluated. 
Because the structure of the Factor VII leader has 
only been identified by cDNA cloning, and because 
of the ambiguity generated by having obtained two 
different 5 f -terminal cDNAs, the inventors also 
constructed a genomic-cDNA Factor VII sequence. 
The y portion of XVII2463 (from the Bgl II site in 
exon 2, to the EcoRI site linkered 3' to the poly(A) 
tail) was adjoined to a subgenomic fragment of 
clone 7m1 t that encoding exons 1a, 1b and the 
remainder of exon 2. This subgenomic fragment, 
reconstructed as an EcoRI-Bglll 4.4 Kb fragment, 
was adjoined to the XVII2463 cDNA and cloned into 
a mammalian expression vector. 

Briefly, in order to construct the subclones, the 
Factor VII cDNA EcoRI fragment of XVII2463 was 
cloned into the EcoRI site of pUC18, and des- 
ignated pVII2463. Similarly, the EcoRI cDNA insert 
of XVII565 was subcloned into pUC18, and des- 
ignated pVII565. A hybrid between the 5 1 portion of 
the Factor VII sequence of clone pVII565 and the 3* 
segment of Factor VII DNA of pVII2463.was con- 
structed by cloning the 5'-most EcoRI-Bgl It Factor 
VII fragment of pVII565 and the Bgl ll-Hind til 
Factor VII fragment (Hind HI site in polylinker of 
pVII2463) of pVII2463 into pUC 18 digested with 
EcoRI and Hind III. This construct was designated 
pVII2397. The inserts of pVII2463 and pVII2397 
were removed by EcoRI digestion and gel purified 
for insertion into mammalian expression vectors as 
described below. 

A. Expression of full-length Factor VII cDNA. 

The expression of Factor VII was achieved in 
the vector pDX. This vector was derived from pD3 
(described in Example 8 above) and pD3\ a vector 
identical to pD3 except that the SV40 polyadenyla- 



26 



49 



0 200 421 



50 



tion signal (i.e. the SV40 Bam HI [2533 bp] to Bell 
[2770bp] fragment) is in the late orientation. Thus, 
pD3 f contains a Bam HI site as the site of gene 
insertion. 

To generate pDX, the EcoRI site in pD3' was 5 
converted to a Bel I site by Eco Rl cleavage, 
incubation with S1 nuclease, and subsequent liga- 
tion with Bel I linkers. DNA was prepared from a 
positively identified colony, and the 1.9 kb Xhol- 
Pstl fragment containing the altered restriction site w 
was prepared via agarose gel electrophoresis. In a 
second modification, Bel l-cleaved pD3 was Itgated 
with kinased Eco Rl-Bcl f adaptors (constructed 
from oligonucleotides ZC 525, 5 ' GGAATTCT 3 ; and 
ZC526, 5 GATCAGAATTCC 3 ) in order to generate 75 
an Eco Rl site as the position for inserting a gene 
into the expression vector. Positive colonies were 
identified by restriction endonuclease analysis, and 
DNA from this was used to isolate a 2.3 kb Xhol- 
Pstl fragment containing the modifed restriction 20 
site. The two above-described DNA fragments were 
incubated together with T4 DNA ligase, trans- 
formed into E. coN HB101 and positive colonies 
were identified by restriction analysis. A prepara- 
tion of such DNA, termed pDX, was made (Figure 25 
12). This DNA was cleaved with Eco Rl and subse- 
quently incubated with calf-intestinal phosphatase. 
The purified DNA was then incubated with T4 DNA 
ligase and the Factor VII Eco Rl fragment from 



TABLE 5 

ELISA Positive 



DNA 


Cell 
Line 


Cell 
Number 


Material (ng/ml 
Culture Medium) 


FVII(2463)/pDX 


COS 


2 x 10 6 


15 


FVII (565+2463 )/pDX 


COS 


2 x 10 6 


12 


Control 


COS 


2 x 10 6 


<2 


FVII (2463) /pDX 


BHK 


9 x 10 6 


62 


FVII ( 565+2463 )/pDX 


BHK 


9 x 10 6 


6 


Control 


BHK 


9 x 10 6 


<2 



pVII2463, or with the Factor VII Eco Rl cDNA 
fragment derived from pVII2397. The resultant 
clones were designated FVII(2463)/pDX and FVII- 
(565 + 2463)/pDX, respectively (Figure 12). After 
transformation into E. £gh JM83 and subsequent 
identification by restriction enzyme analysis, plas- 
mid DNA preparations were made and checked by 
extensive restriction endonuclease digestion. The 
plasmids FVII(2463)/pDX and FVII(565 + 2463)- 
/pDX have been deposited with American Type 
Culture Collection and have been assigned acces- 
sion numbers 40206; and 40205, respectively. 

FVII(2463)/pDX and FVII(565 + 2463)/pDX (10 
ug each) were each transfected, along with 10 ug 
salmon sperm carrier DNA, into either BHK tk*t!3 
cells (Floros et al., Exoer. Cell Res. 132 : 215-223, 
1981) or COS cells, using standard calcium-phos- 
phate precipitation. Following transfection, the cells 
were cultured in the appropriate media containing 5 
ug/ml vitamin K for two days. At this time, the 
supernatants were assayed for ELISA-positive ma- 
terial, using a monoclonal antibody directed against 
Factor VII. Both FVII(2463)/pDX and FVII - 
(565 + 2463)/pDX directed the production of Factor 
VII polypeptide which was detected in COS cell 
supernatants, and Factor VII from FVII(565 + 2463)- 
pDX was detected in BHK cell supernatant. Sham- 
transfected BHK ceils or COS cells did not yield 
detectable levels of Factor VII (Table 5). 



Transient expression of Factor Vtl and also 
tested in several other cell lines, listed in Table 6. 
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Name 

1. Rat Hep I 

2 . Rat Hep II 

3. TCMK 

4 . Human lung 



5 • Human 

hepatoma 

6. Hep G2 



7. Mouse liver 

8. COS 

9. BHK 

10. 293 

11. DUKX 



TABLE 6 
Description 

Rat hepatoma H4-II-E-C3 

Rat hepatoma H4-II-E 

Mouse Kidney, SV40 virus 
transformed , TCMK-1 

SV40 virus transformed 
WI-38 VA13, subline 2RA 

Human liver adenocarcinoma 
SK-HEP-1 

Human hepatoma, dev. by 
Barbara Knowles/Wistar 
Institute 

NCTC clone 1469 

SV40-transf ormed CV-1 
(monkey) cells 

Baby hamster kidney 
BHK-21 tC-13) 

Human embryonic 
Kidney/Ad transformed 

CH0-DHFR sens 



Reference 
(ATTC #) 

CRL 1600 

CRL 1548 

CCL 139 

CCL 75.1 

HTB-52 

HTB 806 5 

CC 29.1 
CRL 1650 

CCL 10 

CRL 1573 

(Urlaub & 
Chasin , 
PNAS (USA) 
72: 4216- 
4220, 1980) 



Cells were cotransfected with 10 ug of either 
FVII(2463)/pDX or FVI 1(565 + 2463)/pDX together 
with 1 ug of a plasmid comprising the chloram- 
phenicol acetyl transacetylase gene (to permit 



identification of cotransfected cells) and 10 ug of 
salmon sperm DNA. Mock transfected cells were 
used as controls. Spent media were assayed by 
ELISA after six days. Results are given in Table 7, 
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TABLE 7 






















Number 


ELISA 


Sample 


Cell Line 


Plasmid 


x 10 b 


( ng/ml ) 


1. 


Rat Hep I 


Mock 


11.6 


2.4 


2. 


Rat Hep I 


FVII (565+2463 )/pDX 


7.0 


2 


3. 


Rat Hep I 


FVII(2463)/pDX 


7.0 


<2 


4. 


Rat Hep 2 


Mock 


13.0 


<2 


5. 


Rat Hep 2 


FVII ( 565+2463 )/pDX 


18 .4 


<2 


6. 


Rat Hep 2 


FVII(2463)/pDX 


14.6 


<2 


7. 


TCMK 


Mock 


18 .8 


<2 


8. 


TCMK 


FVII ( 565+2463 )/pDX 


9.8 


<2 


9. 


TCMK 


FVII ( 2463 )/pDX 


12.8 


<2 


10 . 


Human Lung 


Mock 


7.2 


<2 


11. 


Human Lung 


FVII (565+2463 )/pDX 


3.4 


16 .5 
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12. 


Human 


Lung 


FVII(2463)/pDX 


3.4 


12 


.2 


13. 


Human 


Hepatoma 


Mock 


13.0 


<2 




14. 


Human 


Hepatoma 


FVIK 565+2463 )/pDX 


11.0 


3 


.5 


15. 


Human 


Hepatoma 


FVII(2463)/pDX 


6.0 


3 


.0 


16. 


HepG2 




Mock 


6.8 


21 




17. 


HepG2 




FVII (565+2463 )/pDX 


6.0 


45 


.5 


18. 


HepG2 




FVII (2463 ) /pDX 


6.0 


28 




19. 


Mouse 


Liver 


Mock 


3.8 


<2 




20 . 


Mouse 


Liver 


FVIK 565+2463 )/pDX 


4.0 


<2 




21. 


Mouse 


Liver 


FVII (2463 )/pDX 


3.6 


<2 




22. 


COS 




Mock 


5.6 


<2 




23 . 


• COS 




FVII (565+2463 )/pDX 


5.6 


15. 


5 


24. 


COS 




FVIK 2463 )/pDX 


4.4 


14. 


.5 


25. 


BHK tk 


~tl3 


Mock 


3.0 


<2 




26. 


BHK tk 


"tl3 


FVIK 565+2463 )/pDX 


5.0 


25 




27. 


BKH tk 


"tl3~ 


FVIK 2463 )/pDX 


4.0 


. 22. 


5 


28 . 


293 




Mock 


5.8 


<2 




29. 


293 




FVIK 565+2463 ) /pDX 


6.2 


94 




30 . 


293 




FVIK 2463 )/pDX 


8.2 


100 




31. 


DUKX 




Mock 


11 . 6 


<2 




32. 


DUKX 




FVIK 565+2463 )/pDX 


13.0 


<2 




33 . 


DUKX 




FVIK 2463 )/pDX 


13 .6 


<2 





FVII(2463)/pDX (10 ug) or FVII<565 + 2463)/pDX 
(10 ug) was co-transfected with 10 ug of salmon 
sperm DNA and 1 ug of a ptasmid encoding the 
resistant form of dihydrofolate reductase (Simonsen 
and Levinson, Proc. Natl. Acad. Sci. USA 80 : 
2495-2499, 1983) in a mammalian expression vec- 
tor, into BHK tk*t13 cells. After two days, the cells 
were split 1:14 and placed into selective media 
containing either 250 nM or 1000 nM methotrexate 
(MTX) and 5 ug/ml vitamin K (phytadione, Merck). 
After two weeks, colonies were isolated and grown 
to 50-90% confluency. The supernatant media 
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were then assayed for Factor VII polypeptide by 
ELISA. Of the 25 positive clones, 22 were analyzed 
further. The cells were plated at 5 x 10* (Group I) 
or 1 x 10 s (Group II) in 10 cm dishes containing 5 
ug/ml vitamin K, and either 250 nM or 1000 nM 
methotrexate. Five days later, the faster growing 
clones (designated by an asterisk in Table 8) were 
split 1:2, then after 24 hours, the media were 
changed on all plates. Twenty-three hours (Group 
I) or 20 hours (Group II) later, supernatant media 
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were harvested and cell counts were taken for each 
clone. The media were assayed both by ELISA and 
by the one-stage clotting assay. Results are shown 
in Table 8. 

5 



TABLE 8 
GROUP I - 23-hour assay 



Clone 


Cell 
Count 

Plasmid X10~ 5 


ELISA 

(pg/ 
cell/ 
day) 


ELISA 
(ng/ 
ml) 


Clot- 
ting 
(ng/ 
ml ) 


% 

Active 




PVII (565+2463 )/pt>X 


2 


6.5 


130 


206 


158 


B4-B1* 


FVIK 565+2463 )/pDX 


27 


1.9 


513 


360 


70 


B4-C1 


FVII (565+2463 )/pDX 


16 


2.5 


393 


480 


122 


B4-C2* 


FVII ( 565+2463 ) /pDX 


9 


<0.2 


<20 


21 




B4-C3 


FVII (565+2463 )/pDX 


52 


1.5 


800 


910 


-114 


B4-D1* 


FVII (565+2463 )/pDX 


27 


2.0 


553 


570 


103 


B4-D2* 


FVII (565+2463 )/pDX 


13 


1.2 


150 


154 


103 


B4-E1 


FVII ( 565+2463 ) /pDX 


39 


2.2 


870 


1160 


133 


B4-E2* 


FVII ( 565+2463 ) /pDX 


8 


2.5 


205 


240 


117 


B4-E3 


FVII ( 565+2463 ) /pDX 


23 


1.2 


275 


320 


116 


B4-E4 


FVII (565+2463 )/pDX 


31 


1.3 


410 


300 


73 


B3-5.3* 


FVII (2463) /pDX 


5 


8.2 


410 


290 


70 




GROUP II 


- 20-hour assay 






Clone 


Plasmid 


Cell 
Count 
X10 -5 


ELISA 

(pg/ 

cell/ 
day) 


ELISA 
(ng/ 
ml) 


Clot- 
ting 
(ng/ 
ml) 


% 

Active 


B3-2.2 


FVIK 2463 )/pDX 


41 


2.5 


1043 


500 


48 


B3-2.3 


FVII (2463) /pDX 


19 


3.0 


580 


610 


105 


B3-3.2 


FVIK 2463 )/pDX 


13 


1.5 


197 


216 


110 


B3-4.2 


FVIK 2463 )/pDX 


41 


1.8 


760 


620 


82 


B3-5.1 


FVIK2463)/pDX 


14 


3.3 


460 


400 


87 


B3-5.2 


FVIK 2463 )/pDX 


9 


2.7 


257 


184 


72 


B6-D 


FVIK 2463 )/pDX 


54 


3.0 


1700 


780 


46 


B6-E 


FVIK 2463 )/pDX 


101 


1.6 


1640 


970 


59 


B6-G 


FVII(2463)/pDX 


31 


5.7 


1853 


1080 


58 


B6-M 


FVIK2463)/pDX 


75 


2.2 


1743 


940 


54 
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B. Expression of Factor VIII genomic-cDNA hybrid. 

An expression vector containing genomic se- 5 
quences representing the Factor VII genomic 5' - 
terminus and cDNA sequences from the Factor VII 
gene 3'-terminus was prepared as follows. Three 
subclones of the genomic plasmid 7 ml were used 
to reconstruct the S'-terminus: 7Bam, 7SD and w 
7SE. Plasmid 7Bam is a 3.6 Kb EcoRI-Bam HI 
fragment containing exon 1a, subcloned into 
PUC12. A 0.7 Kb EcoRI-Xbal fragment, which con- 
tains exon 1a, was isolated from this subclone and 
is designated fragment a. Plasmid 7SD is a 3.7 Kb is 
Sstl fragment containing exon 1b, subcloned into 
pUC18. An exon 1 b-containing 3.1 Kb Xbal-Sstl 
fragment was isolated from this subclone and is 
designated fragment b. Plasmid 7SE is a 3.9 Kb 
Sstl fragment containing exons 2-4, subcloned into 
M13mp 19. An Sstl-Bgl II (0.6 Kb) fragment con- 
taining the 5* part of exon 2 was gel isolated and is 
designated fragment c. The remainder of the 3'- 
Factor VII cDNA (fragment d) was obtained as a 2 
Kb Bgl ll-EcoRI fragment from pUVII2463. Frag- 
ments a-d were ligated with EcoRI-cleaved and calf 
intestinal-phosphatased pDX, and then transformed 
into E. coH JM83 or HB101. Positive colonies were 
identified by restriction endonuclease analysis, and 
plasmid DNA was prepared from these colonies. 30 

For expression Factor VII, the plasmid DNA is 
co-transfected into BHK or COS cells as described 
above. Transfected cells are cultured in vitamin K- 
containing medium for 2 days, and the medium is 
assayed for Factor VII by ELISA. 35 

From the foregoing it will be appreciated that, 
although specific embodiments of the invention 
have been described herein for purposes of illustra- 
tion, various modifications may be made without 
deviating from the spirit and scope of the invention. 40 
Accordingly, the invention is not to be limited ex- 
cept by the appended claims. 

The various strains of E. Coli used in the 
foregoing Examples were the personal choice of 
the inventors. Persons skilled in this art will appre- 
ciate that other suitable strains of E. Coli could be 
substituted (for example if the person already has 
in his possession samples of other suitable E. Coli 
strains). 

Transformants ATCC 53067 and 53068 were so 
deposited with the American Type Culture Collec- 
tion on 28 March 1985 and plasmids ATCC 40205 
and 40206 were deposited with the American Type 
Culture Collection on 25 November 1985, all four 
deposits being made in accordance with the Bu- 55 
dapest Treaty. 



The features disclosed in the foregoing de- 
scription, in the following claims and/or in the ac- 
companying drawings may, both separately and in 
any combination thereof, be material for realising 
the invention in diverse forms thereof. 

Claims 



1. A DNA construct containing a nucleotide se- 
quence encoding a protein which upon activation 
has trwsame or substantially the same biological 
activity for blood coagulation as Factor Vila. 



2. The DNA construct of claim 1 wherein said 
nucleotide sequence codes at least partially for 
Factor VII, said nucleotide sequence comprising a 
first nucleotide sequence encoding a calcium bind- 
20 ing domain, joined to a second nucleotide se- 
quence positioned downstream of said first se- 
quence, said second sequence encoding a cata- 
lytic domain for the serine protease activity of 
Factor Vila, the joined sequences coding for a 
25 protein which upon activiation has the same or 
substantially the same biological activity for blood 
coagulation as Factor Vila. 



3. The DNA construct of claim 2 wherein said first 
nucleotide sequence is substantially that of a gene 
selected from the group consisting of genes encod- 
ing Factor VII, Factor IX. Factor X, Protein C, 
prothrombin, and Protein S. 

4. The DNA construct of claim 2 wherein said first 
nucleotide sequence also encodes a leader pep- 
tide. 

5. The DNA construct of claim 2 wherein said first 
nucleotide sequence includes a synthesized 
double-stranded oligonucleotide. 



6. The DNA construct of claim 5 wherein said 
synthesized double-stranded oligonucleotide codes 

45 for substantially the amino-terminal portion of Fac- 
tor VII. 

7. The DNA construct of claim 1 wherein said 
nucleotide sequence encodes Factor VII. 



8. The DNA construct of claim 7 wherein at least a 
portion of said nucleotide sequence is derived from 
a cDNA clone or a genomic clone of Factor VII. 

9. The DNA construct of claim 7 comprising a first 
nucleotide sequence derived from a cDNA or a 
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genomic clone of Factor VII, joined to a second 
nucleotide sequence positioned downstream of 
said first sequence, said second sequence derived 
from a cDNA clone of Factor VII, the joined se- 
quences coding for a protein which upon activation 
has the same or substantially the same biological 
activity for blood coagulation as Factor Vila. 

10. The DNA construct of claim 7 wherein said 
nucleotide sequence comprises the cDNA se- 
quence of Figure 1b, from bp 36 to bp 1433. 

11. The DNA construct of claim 7 wherein said 
nucleotide sequence comprises the cDNA se- 
quence of Figure 1b, from bp 36 to bp 99, followed 
downstream by the sequence from bp 166 to. bp 
1433. 

12. A recombinant plasmid capable of integration in 
mammalian host cell DNA, said plasmid including a 
promoter followed downstream by a nucleotide se- 
quence according to any of the previous claims 1- 
1 1 , said nucleotide sequence being followed down- 
stream by a polyadenylation signal. 

13. Mammalian cells transfected with a recom- 
binant plasmid according to claim 12. 

14. A method for producing a protein having bio- 
logical activity for blood coagulation mediated by 
Factor Vila, comprising: 

establishing a mammalian host cell which contains 
a DNA construct according to any of the previous 
claims 1-11; 

growing said mammalian host cell in an appropriate 
medium; 



isolating the protein product encoded by said DNA 
construct produced by said mammalian host cell; 
and 

s activating said protein product to generate a protein 
which has the same or substantially the same 
biological activity for blood coagulation as Factor 
Vila. 

70 15. The method of claim 14, including amplification 
of the DNA construct by cotransfection of the host 
cell with a gene encoding dihydrofolate reductase, 
wherein the appropriate medium comprises 
methotrexate. 

75 

16. The method of claim 14 wherein said protein 
product is activated by reacting the protein with a 
proteolytic enzyme selected from the group con- 
sisting of Factor Xlla, Factor IXa, kallikrein, Factor 

20 Xa, and thrombin. 

17. A pharmaceutical preparation for the treatment 
of bleeding disorders containing a protein prepared 
according to claim 14. 

25 

18. A method of producing a protein having biologi- 
cal activity for blood coagulation mediated by Fac- 
tor Vila, which process comprises growing in an 
appropriate medium an established mammalian 

.30 host cell which contains a DNA construct in accor- 
dance with any one of Claims 1 to 1 1 , isolating the 
protein product encoded by said DNA construct 
produced by said mammalian host cell and activat- 
ing said protein product to generate a protein 

35 which has the same or substantially the same 
biological activity for blood coagulation as Factor 
Vila. 
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FIG. 1A 



EcoRJa 24 39 54 

6AATTCC GG TCC ACC ACC AA6 CTC TTC TGC ATT TCT TAC AGT GAT GGG GAC CAG 
Arg Thr Lys Leu Phe Trp Me Ser Tyr Ser Asp Gly Asp Gin 

69 84 99 

TCT GCC TCA AGT CCA TGC CAG AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC 
Cys Ala Ser Ser Pro Cys Gin Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser 

114 129 144 159 

TAT ATC TGC TTC TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG 
Tyr lie Cys Phe Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys 

1 71* 189 20<i Pst la 

GAT GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TA C TGC AGT 
Asp Asp Gin Leu Me Cys Val Asn Glu Aso GJy Gly Cys Glu Gin Tyr Cys Ser 

219 234 249 264 

GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG GGG TAC TCT CTG CTG 

Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu Gly Tyr Ser Leu Leu 

279 294 309 324 

GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT 'GAA TAT CCA TGT GGA AAA ATA CCT 
Ala Asp Gly Val Ser Cys Thr Pro Thr Val Glu Tyr Pro Cys Gly Lys lie Pro 

Xba I 339 354 369 

ATTCTAGAA AAA AGA AAT GCC AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG 
Me Leu Glu Lys Arg Asn Ala Ser Lys Pro Gin Gly Arg lie Val Gly Gly Lys 

384 399 414 429 

GTG TGC CCC AAA GGG GAG TGT CCA TGG CAG GTC CTG TTG TTG GTG AAT GGA GCT 
Val Cys Pro Lys Gly Glu Cys Pro Trp Gin Val Leu Leu Leu Val Asn Gly Ala 

444 459 474 

CAG TTG TGT GGG GGG ACC CTG ATC AAC ACC ATC TGG GTG GTC TCC GCG GCC CAC 
Gin Leu Cys Gly Gly Thr Leu Me Asn Thr Me Trp Val Val Ser Ala Ala His 

I189 504 519 534 

TGT TTC GAC AAA ATC AAG AAC TGG AGG AAC CTG ATC GCG GTG CTG GGC GAG CAC 

Cys Phe Asp Lys Me Lys Asn Trp Arg Asn Leu Me Ala Val Leu Gly Glu His 

51,9 564 579 594 

GAC CTC AGC GAG CAC GAC GGG GAT GAG CAG AGC CGG CGG GTG GCG CAG GTC ATC 
Asp Leu Ser Glu His Asp Gly Asp Glu Gin Ser Arg Arg Val Ala Gin Val Me 

609 Sma I 624 639 

ATC CCC AGC ACG TAC GT C CCG GG C ACC ACC AAC CAC GAC ATC GCG CTG CTC CGC 
Me Pro Ser Thr Tyr Val Pro Gly Thr Thr Asn His Asp Me Ala Leu Leu Arg 

651, 669 684 699 

CTG CAC CAG CCC GTG GTC CTC ACT GAC CAT GTG GTG CCC CTC TGC CTG CCC GAA 
Leu His Gin Pro Val Val Leu Thr Asp His Val Val Pro Leu Cys Leu Pro Glu 
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71* 729 744 

CCC ACC TTC TCT CAC ACC ACC CTG GCC TTC CTC CCC TTC TCA TTC CTC AGC CCC 
Arg Thr Phe Ser Glu Arg Thr Leu All Phe Val Arg Phe Ser Leu Val Ser Cly 

759 774 Nirl 789 804 

TGG GGC CAG CTG CTG QAC CGT GGC GCC ACC GCC CTG GAG CTC ATG GTC CTC AAC 

Trp Cly Gin Leu Leu Asp Arg Gly Ala Thr Ate Leu Glu Veu Het Val. -Leu Asn 

819 834 Psc lb 849 864 

GTG CCC CGG CTG ATG ACC CAG GAC TGC CTG CAG CAG TCA CGG AAG GTG GGA GAC 
Val Pro Arg Leu Net Thr Gin Asp Cys Leu Gin Gin Ser Arg Lys Val Gly Asp 

879 894 909 

TCC CCA AAT ATC ACG GAG TAC ATG TTC TGT GCC GGC TAC TCG GAT GGC AGC AAG 
Ser Pro Asn lie Thr Glu Tyr Met Phe Cys Ala Cly Tyr Ser Asp Gly Ser Lys 

924 939 954 969 

GAC TCC TGC AAG GGG GAC AGT GGA GGC CCA CAT GCC ACC CAC TAC CGG GGC ACG 
Asp Ser Cys Lys Gly Asp Ser Gly Gly Pro His Ala Thr His Tyr Arg Gly Thr 

984 999 1014 

TGG TAC CTG ACG GGC ATC GTC AGC TGG GGC CAG GGC TGC GCA ACC GTG GGC CAC 
Trp Tyr Leu Thr Gly lie Val Ser Trp Gly Gin Gly Cys Ala Thr Val Gly His 

1029 1044 1059 TaqI 1074 

TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG CTG CAA AAG CTC ATG 
Phe Gly Val Tyr Thr Arg Val Ser. Gin Tyr lie Glu Trp Leu Gin Lys Leu Met 

1089 1104 1119 1138 

CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG CGA GCC CCA TTT CCC TAG CCCAGCAGCC 
Arg Ser Glu Pro Arg Pro Gly Val Leu Leu Arg Ala Pro Phe Pro 

Pstlc 

1148 1158 1168 1178 1188 1198 1208 

CTGGCCTGTG GAGAGAAAGC CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATC CC AT AT A TTCTT CTGCA 



1218 1228 1238 1248 1258 1268 1278 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG ACAGAGACAG AAACAGAGAG 



1288 1298 1308 1318 1328 1338 1348 

AG AC AG AG AC AGAGAGAGAC TGAGGGAGAG ACTCTGAGGA CCATGCACAG AGACTCAAAG AGACTCCAAG 



1358 1368 1378 1388 1398 1408 1438 

ATTCAAAGAG ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGAGAGG CAGAGGCAGA CAGGCGCTGG 



1428 1438 1448 1458 1468 1478 1488 

ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG ACAGCCCAGC TGAGCCTCCT TACCTCCCTT 
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1498 1508 1518 1528 1538 1548 1558 

CACCCAACCC CCACCTCCAC CT6ATCTCC7 GCCCCTCACG CTCCTCCTCT GCCTTCATTC CTCCACACAC 

1568 1578 1588 1598 1608 1618 1628 

TACACGCATC ACACACATCC ATCCACACAC ACACACCCCA TCCACACACA CACACATATC CACACACACC 

I638 1648 1658 1668 1678 1688 I698 

CATCCACACA CACATCCTCA CACACACTAC CCAAACACAC CCATCCACAC CCACATACAC ATATCCACAC 

1708 1718 1728 1738 1748 1758 1768 

ACAGATGCAC ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATGCA CGCACACATC AGTCCACACG 

177 8 1788 1798 1808 1818 1828 I838 

GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA GATATGCACA CACATGGATG AGCACACACA 

,848 1858 1868 I878 1888 I898 1908 

CACCAAGTGC GCACACACAC CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 

19 18 1928 1938 1948 1958 1968 1978 

CATGTGTGCT GTCCTCTGAA GGCGGTTCTT TAGCTCTCAC TTTTCTGCTT CTTATCCATT ATCATCTTCA 

1988 1998 2008 2018 2028 2038 2048 

CTTCAGACAA TTCAGAAGCA TCACCATGCA TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC 

2058 2068 2078 2088 2098 2108 2118 

TCCCTTCGCT GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT AAACGGCTGC 



2128 2138 2148 2158 2168 EcoRIb 

GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAGGAATTC 
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FIG. IB -so 

MetValSerGlnAlaLeuArgLeuLeu 
TCAACAGGCAGGGGCAGCACTGCAGAGATTTCATCATGGTCTCCCAGGCCCTCAGGCTCCTC 
10 20 30 40 50 60 

-50 -40 i 

CysLeuLeuLeuGlyLeuGlnGlyCysLeuAlaAlaGlyGlyValAlaLysAlaSerGlyGly 
TGCCTTCTGCTTGGGCTTCAGGGCTGCCTGGCTGCAGGCGGGGTCGCTAAGGCCTCAGGAGG A 
70 80 90 100 ll6 120 

-30 -20 i 

GluThrArgAspMetProTrpLysProGlyProHi3ArgValPheValThrGlnGluGlu 
GAAACACGGGACATGCCGTGGAAGCCGGGGCCTCACAGAGTCTTCGT AACCCAGGAGGAA 
130 140 150 160 170 180 

-10 -1+1 +10 

AlaHisGlyValLeuHisArgArgArgArgAlaAsnAlaPheLeuGluGluLeuArgPro 
GCCCACGGCGTCCTGCACCGGCGCCGGCGCGCCAACGCGTTCCTGGAGGAGCTGCGGCCG 
190 200 210 220 230 240 

+20 +30 
GlySerLeuGluArgGluCysLysGluGluGlnCysSerPheGluGluAlaArgGluIle 
GGCTCCCTGGAGAGGGAGTGCAAGGAGGAGCAGTGCTCCTTCGAGGAGGCCCGGG AGATC 
250 260 270 280 290 300 

+40 +50 
PheLysAspAlaGluArgThrLysLeuPheTrpIleSerTyrSerAspGlyAspGlnCys 
TTCAAGGACGCGGAGAGGACGAAGCTGTTCTGGATTTCTT ACAGTGATGGGGACCAGTGT 
310 320 330 340 350 360 

+60 +70 
AlaSerSerProCysGlnAsnGlyGlySerCysLysAspGlnLeuGlnSerTyrlleCys 
GCCTCAAGTCCATGCCAGAATGGGGGCTCCTGCAAGGACCAGCTCCAGTCCTATATCTGC 
370 380 390 400 410 420 

+80 +90 
PheCysLeuProAlaPheGluGlyArgAsnCysGluThrHisLysAspAspGlnLeuIle 
TTCTGCCTCCCTGCCTTCGAGGGCCGGAACTGTGAGACGCACAAGGATGACCAGCTGATC 
430 440 450 460 470 480 

+100 +110 
CysValAsnGluAsnGlyGlyCysGluGlnTyrCysSerAspHisThrGlyThrLysArg 
TGTGTGAACGAGAACGGCGGCTGTGAGCAGTACTGCAGTGACCACACGGGCACCAAGCGC 
490 500 510 520 530 540 

+120 +130 
SerCysArgCysHisGluGlyTyr SerLeuLeuAlaAspGlyValSerCysThrProThr 
TCCTGTCGGTGCCACGAGGGGTACTCTCTGCTGGCAGACGGGGTGTCCTGCACACCCACA 
550 560 570 580 590 600 

+140 +150 
ValGluTyrProCysGlyLysIleProlleLeuGluLysArgAsnAlaSerLysProGln 
GTTGAATATCCATGTGGAAAAATACCTATTCTAGAAAAAAGAAATGCCAGCAAACCCCAA 
610 620 630 640 650 660 
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+160 +170 
GlyArglleValGlyGlyLysValCysProLysGlyGluCysProTrpGlnValLeuLeu 
GGCCGAATTGTGGGGGGCAAGGTGTGCCCCAAAGGGGAGTGTCCATGGCAGGTCCTGTTG 
670 680 690 700 710 720 

+180 +190 
LeuValAsnGlyAlaGlnLeuCysGlyGlyThrLeuIleAsnThrlleTrpValValSer 
TTGGTGAATGGAGCTCAGTTGTGTGGGGGGACCCTGATCAACACCATCTGGGTGGTCTCC 
730 740 750 760 770 780 

+200 +210 
AlaAlaHisCysPheAspLysIleLysAsnTrpArgAsnLeuTleAlaValLeuGlyGlu 
GCGGCCCACTGTTTCGACAAAATCAAGAACTGGAGGAACCTGATCGCGGTGCTGGGCGAG 
790 880 810 820 830 840 

+220 +230 
HisAspLeuSerGluHisAspGlyAspGluGlnSerArgArgValAlaGlnValTlelle 
CACGACCTCAGCGAGCACGACGGGGATGAGCAGAGCCGGCGGGTGGCGCAGGTCATCATC 
850 860 870 880 890 900 

+240 +250 
ProSerThrTyrValProGlyThrThrAsnHisAspIleAlaLeuLeuArgLeuHisGln 
CCCAGCACGT ACGTCCCGGGCACCACCAACCACGACATCGCGCTGCTCCGCCTGCACCAG 
910 920 930 940 950 960 

+260 +270 
ProValValLeuThrAspHisValValProLeuCysLeuProGluArgThrPheSerGlu 
CCCGTGGTCCTCACTGACCATGTGGTGCCCCTCTGCCTGCCCGAACGGACGTTCTCTGAG 
970 980 990 1000 1010 1020 

+280 _ +290 

ArgThrLeuAlaPheValArgPheSerLeuValSerGlyTrpGlyGlnLeuLeuAspArg 
AGGACGCTGGCCTTCGTGCGCTTCTCATTGGTCAGCGGCTGGGGCCAGCTGCTGGACCGT 
1030 1040 1050 1060 1070 1080 

+300 +310 
GlyAlaThrAlaLeuGluLeuMetValLeuAsnValProArgLeuMetThrGlnAspCys 
GGCGCCACGGCCCTGGAGCTCATGGTCCTCAACGTGCCCCGGCTGATGACCCAGGACTGC 
1090 1100 1110 1120 1130 1140 

+320 +330 
LeuGlnGlnSerArgLysValGlyAspSerProAsnlleThrGluTyrMetPheCysAla 
CTGCAGCAGTCACGGAAGGTGGGAGACTCCCCAAATATCACGGAGTACATGTTCTGTGCC 
1150 1160 1170 1180 1190 1200 

+340 +350 
GlyTyr SerAspGlySerLysAspSerCysLysGlyAspSerGlyGlyProHisAlaThr 
GGCTACTCGGATGGCAGCAAGGACTCCTGCAAGGGGGACAGTGGAGGCCCACATGCCACC 
1210 1220 1230 1240 1250 1260 

+360 +370 
HisTyr ArgGlyThrTrpTyrLeuThrGlylleValSerTrpGlyGlnGlyCysAlaThr 
CACT ACCGGGGCACGTGGT ACCTGACGGGCATCGTCAGCTGGGGCCAGGGCTGCGCAACC 
1270 1280 1290 1300 1310 1320 
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+380 +390 
ValGlyHisPheGlyValTyrThrArgValSerGlnTyrlleGluTrpLeuGlnLysLeu 
GTGGGCCACTTTGGGGTGT ACACCAGGGTCTCCCAGTACATCGAGTGGCTGCAAAAGCTC 
1330 1340 1350 1360 1370 1380 

+400 +406 
Met ArgSerGluProArgProGlyValLeuLeuArgAlaProPhePro*** 
ATGCGCTCAGAGCCACGCCCAGGAGTCCTCCTGCGAGCCCCATTTCCCTAGCCCAGCAGC 
1390 1400 1410 1420 1430 1440 

CCTGGCCTGTGG AG AG AA AGCC AAGGCTGCGT CG AACTGT CCTGGC ACC AAATCCCAT AT 
1450 1460 1470 1480 1490 1500 

ATTCTTCTGC AGTT AATGGGGT AG AGG AGGGCATGGG AGGG AGGG AG AGGTGGGG AGGG A 
1510 1520 1530 1540 1550 1560 

G AC AG AG AC AG A A AC AG AG AG AG AC AG AG AC AG AG AG AG ACTG AGGG AG AG ACT CTG AGG 
1570 1580 1590 1600 1610 1620 

AC ATGG AG AG AG ACT C AAAG AG ACT CC AAG ATTC A AAG AG ACT A AT AG AG AC AC AG AG AT 
1630 1640 1650 1660 1670 1680 

GGAATAGAAAAGATGAGAGGCAGAGGCAGACAGGCGCTGGACAGAGGGGCAGGGGAGTGC 
1690 1700 1710 1720 1730 1740 

CAAGGTTGTCCTGGAGGCAGACAGCCCAGCTGAGCCTCCTTACCTCCCTTCAGCCAAGCC 
1750 1760 1770 1780 1790 1800 

CCACCTGCACGTGATCTGCTGGCCCTCAGGCTGCTGCTCTGCCTTCATTGCTGGAGACAG 
1810 1820 1830 1840 1850 1860 

TAGAGGCATGAACACACATGGATGCACACACACACACGCCAATGCACACACACAGAGATA 
1870 1880 1890 1900 1910 ~ 1920 

TGCACACACACGGATGCACACACAGATGGTCACACAGAGATACGCAAACACACCGATGCA 
1930 1940 1950 1960 1970 1980 

CACGCACATAGAGATATGCACACACAGATGCACACACAGATATACACATGGATGCACGCA 
1990 2000 2010 2020 2030 2040 

CATGCCAATGCACGCACACATCAGTGCACACGGATGCACAGAGATATGCACACACCGATG 
2050 2060 2070 2080 2090 2100 

TGCGCACACACAGATATGCACACACATGGATGAGCACACACACACCAAGTGCGCACACAC 
2110 2120 2130 2140 2150 2160 

ACCGATGT ACACACACAGATGCACACACAGATGCACACACACCGATGCTGACTCCATGTG 
2170 2180 2190 2200 2210 2220 

TGCTGTCCTCTGAAGGCGGTTGTTTAGCTCTCACTTTTCTGGTTCTTATCCATTATCATC 
2230 2240 2250 2260 2270 2280 

TTCACTTCAGACAATTCAGAAGCATCACCATGCATGGTGGCGAATGCCCCCAAACTCTCC 
2290 2300 2310 2320 2330 2340 



CCCAAATGT ATTTCTCCCTTCGCTGGGTGCCGGGCTGCACAGACTATTCCCCACCTGCTT 
2350 2360 2370 2380 2390 2400 
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CCCAGCTTCACAATAAACGGCTGCGTCTCCTCCGCACACCTGTGGTGCCTGCCACCCAAA 
2410 2420 2430 2240 2450 2460 

AAAAAAAAAAAAAAAAAA 
2470 2480 
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FIG. 7 36 

GGATCC ATG CAG CGC GTG AAC ATG ATC ATG GCA GAA TCA CCA GGC 
MET Gin Arg Val Asn MET lie MET Ala Glu Ser Pro Gly 

66 81 
CTC ATC ACC ATC TGC CTT TTA GGA TAT CTA CTC AGT GCT GAA TGT 
Leu lie Thr lie Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys 

96 111 126 

ACA GTT TTT CTT GAT CAT GAA AAC GCC AAC AAA ATT CTG AAT CGG 
Thr Val Phe Leu Asp His Glu Asn Ala Asn Lys He Leu Asn Arg 

141 156 171 

CCA AAG AGG TAT AAT TCA GGT AAA TTG GAA GAG TTT GTT CAA GGG 
Pro Lys Arg Tyr Asn Ser Gly Lys Leu Glu Glu Phe Val Gin Gly 

186 201 216 

AAC CTT GAG AGA GAA TGT ATG GAA GAA AAG TGT AGT TTT GAA GAA 
Asn Leu Glu Arg Glu Cys MET Glu Glu Lys Cys Ser Phe Glu Glu 

231 246 261 

GCA CGA GAA GTT TTT GAA AAC ACT GAA AGA ACA AAG CTG TTC TGG 
Ala Arg Glu Val Phe Glu Asn Thr Glu Arg Thr Lys Leu Phe Trp 

276 291 306 

ATT TCT TAC AGT GAT GGG GAC CAG TGT GCC TCA AGT CCA TGC CAG 
He Ser Tyr Ser Asp Gly Asp Gin Cys Ala Ser Ser Pro Cys Gin 

321 336 351 

AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC TAT ATC TGC TTC 
Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser Tyr He Cys Phe 

366 381 . 396 

TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG GAT 
Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys Asp 

411 426 441 

GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TAC 
Asp Glu Leu He Cys Val Asn Glu Asn Gly Gly Cys Glu Gin Tyr 

456 471 486 

TGC AGT GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG 
Cys Ser Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu 

501 516 531 

GGG TAC TCT CTG CTG GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT 
Gly Tyr Ser Leu Leu Ala Asp Gly Val Ser Cys Thr Pro Thr Val 

546 561 576 

GAA TAT CCA TCT GGA AAA ATA CCT ATT CTA GAA AAA AGA AAT GCC 
Glu Tyr Pro Cys Gly Lys He Pro He Leu Glu Lys Arg Asn Ala 

591 606 621 

AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG GTG TGC CCC AAA 
Ser Lys Pro Gin Gly Arg He Val Gly Gly Lys Val Cys Pro Lys 
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636 

GGG GAG TGT CCA TGG 
Gly Glu Cys Pro Trp 

681 

TTG TGT GGG GGG ACC 
Leu Cys Gly Gly Thr 

726 

GCC CAC TGT TTC GAC 
Ala His Cys Phe Asp 

771 

GTG CTG GGC GAG CAC 
Val Leu Gly Glu His 

816 

AGC CGG CGG GTG GCG 
Ser Arg Arg Val Ala 

861 

GGC ACC ACC AAC CAC 
Gly TKr Thr Asn His 

906 

GTG GTC CTC ACT GAC 
Val Val Leu Thr Asp 

951 

ACG TTC TCT GAG AGG 
Thr Phe Ser Glu Arg 

996 

AGC GGC TGG GGC CAG 
Ser Gly Trp Gly Gin 

1041 

CTC ATG GTC CTC AAC 
Leu MET Val Leu Asn 

1086 

CAG CAG TCA CGG AAG 
Gin Gin Ser Arg Lys 

1131 

ATG TTC TGT GCC GGC 
MET Phe Cys Ala Gly 

1176 

GGG GAC AGT GGA GGC 
Gly Asp Ser Gly Gly 



651 

CAG GTC CTG TTG TTG 
Gin Val Leu Leu Leu 

696 

CTG ATC AAC ACC ATC 
Leu He Asn Thr He 

741 

AAA ATC AAG AAC TGG 
Lys He Lys Asn Trp 

786 

GAC CTC AGC GAG CAC 
Asp Leu Ser Glu His 

831 

CAG GTC ATC ATC CCC 
Gin Val He He Pro 

876 

GAC ATC GCG CTG CTC 
Asp He Ala Leu Leu 

921 

CAT GTG GTG CCC CTC 
His Val Val Pro Leu 

966~ 

ACG CTG GCC TTC GTG 
Thr Leu Ala Phe Val 

1011 

CTG CTG GAC CGT GGC 
Leu Leu Asp Arg Gly 

1056 

GTG CCC CGG CTG ATG 
Val Pro Arg Leu MET 

1101 

GTG GGA GAC TCC CCA 
Val Gly Asp Ser Pro 

1146 

TAC TCG GAT GGC AGC 
Tyr Ser Asp Gly Ser 

1191 

CCA CAT GCC ACC CAC 
Pro His Ala Thr His 



666 

GTG AAT GGA GCT CAG 
Val Asn Gly Ala Gin 

711 

TGG GTG GTC TCC GCG 
Trp Val Val Ser Ala 

756 

AGG AAC CTG ATC GCG 
Arg Asn Leu He Ala 

801 

GAC GGG GAT GAG CAG 
Asp Gly Asp Glu Gin 

846 

AGC ACG TAC GTC CCG 
Ser Thr Tyr Val Pro 

891 

CGC CTG CAC CAG CCC 
Arg Leu His Gin Pro 

936 

TGC CTG CCC GAA CGG 
Cys Leu Pro Glu Arg 

981 

CGC TTC TCA TTG GTC 
Arg Phe Ser Leu Val 

1026 

GCC ACG GCC CTG GAG 
Ala Thr Ala Leu Glu 

1071 

ACC CAG GAC TGC CTG 
Thr Gin Asp Cys Leu 

1116 

AAT ATC ACG GAG TAC 
Asn lie Thr Glu Tyr 

1161 

AAG GAC TCC TGC AAG 
Lys Asp Ser Cys Lys 

1206 

TAC CGG GGC ACG TGG 
Tyr Arg Gly Thr Trp 



1221 1236 1251 

TAC CTG ACG GGC ATC GTC AGC TGG GGC CAG GGC TGC GCA ACC GTG 
Tyr Leu Thr gly lie Val Ser Trp Gly Gin Gly Cys Ala Thr Val 



♦ 
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1266 1261 1296 

GGC CAC TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG 
Gly his Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie Glu Trp 

1311 1326 1341 

CTG CAA AAG CTC ATG CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG 
Leu Gin Lys Leu MET Arg Ser Glu Pro Arg Pro Gly Val Leu Leu 

1356 1378 1388 1398 

CGA GCC CCA TTT CCC TAG CCCAGCAGCC CTGGCCTGTG GAGAGAAAGC 
Arg Ala Pro Phe Pro 

1408 1418 1428 1438 1448 

CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATCCCATATA TTCTTCTGCA 



1458 1468 1478 1488 1498 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG 



1508 1518 1528 1538 1548 

A CA GAGA CAG AAACAGAGAG AGACAGAGAC AGAGAGAGAC TGAGGGAGAG 



1558 1568 1578 1588 1598 

ACTCTGAGGA CCATGGAGAG AGACTCAAAG AGACTCCAAG ATTCAAAGAG 

1608 1618 1628 16*38 1648 

ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGaGAGG CAGAGGCAGA 



1658 1668 1678 1688 1698 

CAGGCGCTGG ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG 



1708 1718 1728 1738 1748 

ACAGCCCAGC TGAGCCTCCT TACCTCCCTT CAGCCAAGCC CCACCTGCAC 



1758 1768 1778 1788 1798 

GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 



1808 1818 1828 1838 1848 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA 



1858 1868 1878 1888 1898 

CAGAGATATG CACACACACG GATGCACACA CAGATGGTCA CACAGAGTAC 



1908 1918 1928 1938 1948 

GCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC ACAGATGCAC 
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1958 1968 1978 1988 1998 

ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATGCA CGCACACATC 

2008 2018 2028 2038 2048 

AGTGCACACG GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA 

2058 2068 2078 2088 2098 

GATATGCACA CACATGGATG AGCACACACA CACCAAGTGC GCACACACAC 

2108 2118 2128 2138 2148 

CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 

2158 2168 2178 2188 2198 

CATGTGTGCT GTCCTCTGAA GGCGGTTGTT TAGCTCTCAC TTTTCTGGTT 

2208 2218 2228 2238 2248 

. CTTATCCATT ATCATCTTCA CTTCAGACAA TTCAGAAGCA TCACCATGCA 

2258 2268 2278 2288 2298 

TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC TCCCTTCGC1 

2308 2318 2328 2338 2348 

GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT 



2358 2368 2378 2388 2398 

AAACGGCTGC GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



2408 
AAAAAAAAAA 



2418 
AAGGAATTCG 



2428 2438 
AGCTCGGTAC CCGGGGATCC 
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